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INSECTICIDAL PROTEINS FROM BACILLUS THURINGIENSIS 

INTRODUCTION 

The present invention relates to new DNA sequences encoding insecticidal proteins produced 
by Bacillus thuringiensis strains. Particularly, new DNA sequences encoding proteins designated 
as Cry9Fa, Cryljd, and CrylBf are provided which are useful to protect plants from insect 
damage. Also included herein are micro-organisms and plants transformed with at least one of 
the newly isolated genes so that they are useful to confer insect resistance by expression of 
insecticidal protein. 

BACKGROUND OF THE INVENTION 

(i) Field of the Invention: 

Bt or Bacillus thuringiensis is well known for its specific toxicity to insect pests, and has been 
used since almost a century to control insect pests of plants. In more recent years, transgenic 
plants expressing Bt proteins were made which were found to successfully control insect 
damage on plants (e.g., Vaeck et al., 1987). 

Despite the isolation of a number of Bt crystal protein genes, the search for new genes 
encoding insecticidal proteins continues. Indeed, insecticidal Bt crystal proteins are known to 
have a relatively narrow target insect range compared to chemical insecticides. Also, having 
multiple toxins active on the same target insect species allows the use of proteins having 
different modes of action so that insect resistance development can be prevented or delayed. 

(ii) Description of Related Art: 

Previously, several types of Cry1 B-, Cry1 J-, and Cry9-proteins were identified (see Crickmore 
et al., 1998, incorporated herein by reference, for all details). 
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The new Cry1 Bf protein has the closest sequence identity with the Cry1 Be protein (Payne et 
al, 1998, US Patent 5,723,758), but still differs in about 14 percent of the amino acid 
sequence of its toxic protein fragment with the toxic fragment of the Cryl Be protein. 
The closest sequence identity with the Cryljd toxic fragment was found in the toxic fragment 
of the Cryljd protein (US Patent 5,723,758), but the toxic fragments of both proteins still 
differ in about 18 % of their amino acid sequence. 

The closest sequence identity with the Cry9Fa toxic fragment was found with the toxic 
fragment of the Cry9Ea1 protein as described by Midoh et al. (PCT Patent publication WO 
98/26073) and Narva et al. (PCT patent publication WO 98/00546), but the toxic fragments of 
the Cry9Fa and Cry9Ea proteins still differ in about 21 % of their amino acid sequence. 

SUMMARY OF THE INVENTION 

In accordance with this invention is provided a DNA sequence encoding a protein comprising 
the amino acid sequence selected from the group consisting of: a) the amino acid sequence of 
the insecticidal trypsin-digestion fragment of the protein encoded by the crylBf gene 
deposited at the BCCM-LMBP under accession number LMBP 3986, b) the amino acid 
sequence of the insecticidal trypsin-digestion fragment of the protein encoded by the cryljd 
gene deposited at the BCCM-LMBP under accession number LMBP 3983, and c) the amino 
acid sequence of the insecticidal trypsin-digestion fragment of the protein encoded by the 
cry9Fa gene deposited at the BCCM-LMBP under accession number LMBP 3984. 

Particularly preferred in accordance with this invention is a DNA sequence encoding a protein 
comprising the amino acid sequence selected from the group consisting of: the amino acid 
sequence of an insecticidal fragment of the protein of SEQ ID No. 2, the amino acid sequence 
of an insecticidal fragment of the protein of SEQ ID No. 4, and the amino acid sequence of an 
insecticidal fragment of the protein of SEQ ID No. 6; alternatively, a DNA encoding a protein 
comprising the amino acid sequence of the group selected from: the amino acid sequence of 
SEQ ID No. 2, the amino acid sequence of SEQ ID No. 4, the amino acid sequence of SEQ ID 
No. 6; or a DNA sequence comprising the DNA sequence of SEQ ID No. 1, SEQ ID No. 3, or 
SEQ ID No. 5.. 
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Further, in accordance with this invention are provided DNA sequences encoding at least the 
following portions of the newly-isolated proteins: the amino acid sequence of SEQ ID No. 2 
from amino acid position 1 to amino acid position 640, the amino acid sequence of SEQ ID 
No. 4 from amino acid position 1 to amino acid position 596, and the amino acid sequence of 
SEQ ID No. 6 from amino acid position 1 to amino acid position 652. 

Further, in accordance with this invention are provided the above DNA sequences comprising 
an artificial DNA sequence having a different codon usage compared to the naturally 
occurring DNA sequence but encoding the same protein or its insecticidal fragment. 

Even further provided in accordance with this invention is a protein comprising the amino 
acid sequence selected from the group consisting of: a) the amino acid sequence of the 
insecticidal trypsin-digestion fragment of the protein encoded by the crylBf gene deposited at 
the BCCM-LMBP under accession number LMBP 3986, b) the amino acid sequence of the 
insecticidal trypsin-digestion fragment of the protein encoded by the cryljd gene deposited at 
the BCCM-LMBP under accession number LMBP 3983, and c) the amino acid sequence of 
the insecticidal trypsin-digestion fragment of the protein encoded by the cry9Fa gene 
deposited at the BCCM-LMBP under accession number LMBP 3984. 

Particularly preferred herein is a protein comprising the amino acid sequence selected from 
the group consisting of: the amino acid sequence of an insecticidal fragment of the protein of 
SEQ ID No. 2, the amino acid sequence of an insecticidal fragment of the protein of SEQ ID 
No. 4, and the amino acid sequence of an insecticidal fragment of the protein of SEQ ID No. 
6; alternatively a protein, comprising the amino acid sequence selected from the group 
consisting of: the amino acid sequence of SEQ ID No. 2 from amino acid position 1 to amino 
acid position 640, the amino acid sequence of SEQ ID No. 4 from amino acid position 1 to 
amino acid position 596, and the amino acid sequence of SEQ ID No. 6 from amino acid 
position 1 to amino acid position 652; or a protein comprising the amino acid sequence of 
SEQ ID No. 2, SEQ ID No. 4, or SEQ ID No. 6. 
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Also provided herein are chimeric genes comprising the DNA as defined above under the 
control of a plant-expressible promoter, and plant cells, plants or seeds transformed to contain 
those chimeric genes, particularly plant cells, plants, or seeds selected from the group 
consisting of: corn, cotton, rice, oilseed rape, Brassica species, eggplant, soybean, potato, 
sunflower, tomato, sugarcane, tea, beans, tobacco, strawberry, clover, cucumber, watermelon, 
pepper, oat, barley, wheat, dahlia, gladiolus, chrysanthemum, sugarbeet, sorghum, alfalfa, and 
peanut. In accordance with this invention, the chimeric gene can be integrated in the nuclear 
or chloroplast DNA of the plant cells. 

Further in accordance with this invention are provided micro-organisms, transformed to 
contain any of the above DNA sequences, particularly those selected from the genus 
Agrobacterium, Escherichia, or Bacillus. 

Also provided herein is a process for controlling insects, comprising expressing any of the 
above DNA sequences in a host cell, particularly plant cells, and contacting insects with said 
host cells, and a process for rendering a plant resistant to insects, comprising transforming 
plants cells with any of the above DNA sequences or chimeric genes, and regenerating 
transformed plants from such cells which are resistant to insects. 

DETAILED DESCRIPTION OF PREFERRED EMBODIMENTS 

In accordance with this invention, DNA sequences encoding new Bt toxins have been isolated 
and characterized. The new genes were designated crylBf, cryljd and cry9Fa, and their 
encoded proteins CrylBf, Cryljd and Cry9Fa. 

In accordance with this invention "Cry1 Bf protein" refers to any protein comprising the 
smallest protein fragment of the amino acid sequence of SEQ ID No. 2 which retains 
insecticidal activity, particularly any protein comprising the amino acid sequence from the 
amino acid at position 1 to the amino acid at position 640 in SEQ ID No. 2, including but not 
limited to the complete protein with the amino acid sequence of SEQ ID No. 2. This includes 
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hybrids or chimeric proteins comprising the smallest toxic protein fragment, as well as proteins 
containing at least one of the three functional domains of the toxic fragment of SEQ ID No. 2. 
The term "DNA/protein comprising the sequence X", as used herein, refers to a DNA or 
protein including or containing at least the sequence X, so that other nucleotide or amino 
acid sequences can be included at the 5' (or N-tertninal) and/or 3' (or C-terminal) end, e.g. 
(the nucleotide sequence of) a selectable marker protein as disclosed in EP 0 193 259. 

In accordance with this invention, "Cry9Fa protein" or "Cry9F protein" refers to any protein 
comprising the smallest protein fragment of the amino acid sequence of SEQ ID No. 6 which 
retains insecticidal activity, particularly any protein comprising the amino acid sequence from 
the amino acid at position 1 to the amino acid at position 652 in SEQ ID No. 6, including but 
not limited to the complete protein with the amino acid sequence of SEQ ID No. 6. This 
includes hybrids or chimeric proteins comprising the smallest toxic protein fragment, as well as 
proteins containing at least one of the three functional domains of the toxic fragment of SEQ 
ID No. 6. 

In accordance with this invention, "Cryljd protein" refers to any protein comprising the 
smallest protein fragment of the amino acid sequence of SEQ ID No. 4 which retains 
insecticidal activity, particularly any protein comprising the amino acid sequence from the 
amino acid at position 1 to the amino acid at position 596 in SEQ ID No. 4, including but not 
limited to the complete protein with the amino acid sequence of SEQ ID No. 4. This includes 
hybrids or chimeric proteins comprising the smallest toxic protein fragment, as well as proteins 
containing at least one of the three functional domains of the toxic fragment of SEQ ID No. 4. 

As used herein, the terms "ay7B/*DNA", "cry9Fa DNA", or "cryljd DNA", refer to any DNA 
sequence encoding the Cry1 Bf, Cry9Fa, or Cryljd protein, respectively, as defined above. 
This includes naturally occurring, artificial or synthetic DNA sequences encoding the newly 
isolated proteins or their insecticidal fragments as defined above. Also included herein are 
DNA sequences encoding insecticidal proteins which are similar enough to the coding regions 
of the genomic DNA sequences deposited or the sequences provided in the sequence listing 
so that they can (i.e., have the ability to) hybridize to these DNA sequences under stringent 
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hybridization conditions. Stringent hybridization conditions, as used herein, refers particularly 
to the following conditions: immobilizing the relevant genomic DNA sequences on a filter, 
and prehybridizing the filters for either 1 to 2 hours in 50 % formamide, 5 % SSPE, 2x 
Denhardt's reagent and 0.1 % SDS at 42 0 C or 1 to 2 hours in 6x SSC, 2xDenhardt's reagent 
and 0.1 % SDS at 68 °C. The denatured labeled probe is then added directly to the 
prehybridization fluid and incubation is carried out for 16 to 24 hours at the appropriate 
temperature mentioned above. After incubation, the filters are then washed for 20 minutes at 
room temperature in 1x SSC, 0.1 % SDS, followed by three washes of 20 minutes each at 68 
°C in 0.2 x SSC and 0.1 % SDS. An autoradiograph is established by exposing the filters for 24 
to 48 hours to X-ray film (Kodak XAR-2 or equivalent) at -70 °C with an intensifying screen. Of 
course, equivalent conditions and parameters can be used in this process while still retaining 
the desired stringent hybridization conditions. One of such equivalent conditions includes: 
immobilizing the relevant genomic DNA sequences on a filter, and prehybridizing the filters 
for either 1 to 2 hours in 50 % formamide, 5 % SSPE, 2x Denhardt's reagent and 0.1 % SDS at 
42 ° C or 1 to 2 hours in 6x SSC, 2xDenhardt's reagent and 0.1 % SDS at 68 °C. The 
denatured (dig- or radiolabeled probe is then added directly to the prehybridization fluid 
and incubation is carried out for 16 to 24 hours at the appropriate temperature mentioned 
above. After incubation, the filters are then washed for 30 minutes at room temperature in 2x 
SSC, 0.1 % SDS, followed by 2 washes of 30 minutes each at 68 °C in 0.5 x SSC and 0.1 % 
SDS. An autoradiograph is established by exposing the filters for 24 to 48 hours to X-ray film 
(Kodak XAR-2 or equivalent) at -70 °C with an intensifying screen 

"Insecticidal activity" of a protein, as used herein, means the capacity of a protein to kill 
insects when such protein is fed to insects, preferably by expression in a recombinant host 
such as a plant. "Insect-controlling amounts" of a protein, as used herein, refers to an amount 
of protein which is sufficient to limit damage on a plant by insects feeding on such plant to 
commercially acceptable levels, e.g. by killing the insects or by inhibiting the insect 
development or growth in such a manner that they provide less damage to a plant and plant 
yield is not significantly adversely affected. 

In accordance with this invention, insects susceptible to the new Cry proteins of the invention 
are contacted with this protein in insect-controlling amounts, preferably insecticidal amounts. 
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"Cry protein" or "Cry protein of this invention", as used herein, refers to any one of the new 
proteins isolated in accordance with this invention and identified herein as Cryl Bf, Cry9Fa, or 
CryUd protein. A Cry protein, as used herein, can be a protein in the full length size, also 
named a protoxin, or can be in a slightly or fully (e.g., N- and C-terminal truncation) truncated 
form as long as the insecticidal activity is retained, or can be a combination of different 
proteins or protein parts in a hybrid or fusion protein. A "Cry protoxin" refers to the full 
length crystal protein as it is encoded by the naturally-occurring Bt DNA sequence, a "Cry 
toxin" refers to an insecticidal fragment thereof, particularly the smallest toxic fragment 
thereof, typically in the molecular weight range of about 60 to about 80 kD as determined by 
SDS-PAGE electrophoresis. A "cry gene" or "cry DNA", as used herein, is a DNA sequence 
encoding a Cry protein in accordance with this invention, referring to any of the crylBf, 
cry9Fa, and cryljd DNA sequences defined above. 

The "smallest toxic fragment" of a Cry protein, as used herein, is that fragment as can be 
obtained by trypsin or chymotrypsin digestion of the full length solubilized crystal protein that 
retains toxicity, or that toxic protein fragment encoded by DNA fragments of the Cry protein. 
This protein will mostly have a short N-terminal and a long C-terminal truncation compared to 
the protoxin. Although for recombinant expression, toxic fragments starting at or around 
original amino acid position 1 are a more preferred embodiment in accordance with this 
invention, it should be noted that besides a C-terminal truncation, some N-terminal amino 
acids can also be deleted while retaining the insecticidal character of the protein. The N- 
terminal end of the smallest toxic fragment is conveniently determined by N-terminal amino 
acid sequence determination of trypsin- or chymotrypsin-treated soluble crystal protein by 
techniques routinely available in the art. 

Dna encoding the Cry proteins of this invention can be isolated in a conventional manner 
from the E coli strains, deposited on November 25, 1999 at the BCCM-LMBP under accession 
numbers LMBP 3983, LMBP 3984, LMBP 3985 and LMBP 3986. The encoded Cry proteins can 
be used to prepare specific monoclonal or polyclonal antibodies in a conventional manner (Hofte 
et al., 1988). The toxin forms can be obtained by protease (e.g., trypsin) digestion of the Cry 
protoxins. 
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The DNA sequences encoding the Cry proteins can be isolated in a conventional manner 
from the respective strains or can be synthesized based on the encoded amino acid sequence. 

The DNA sequences encoding the Cry proteins of the invention were identified by 
digesting total DNA from isolated Bt strains with restriction enzymes; size fractionating the DNA 
fragments, so produced, into DNA fractions of 5 to 10 Kb; ligating these fractions to cloning 
vectors; screening the £ co//, transformed with the cloning vectors, with a DNA probe that was 
constructed from a region of known Bt crystal protein genes or with a DNA probe based on 
specific PCR fragments generated from Bt DNA using primers corresponding to known Bt crystal 
protein genes. 

Also, DNA sequences for use in this invention can be made synthetically. Indeed, 
because of the degeneracy of the genetic code, some amino acid codons can be replaced with 
others without changing the amino acid sequence of the protein. Furthermore, some amino acids 
can be substituted by other equivalent amino acids without significantly changing the insecticidal 
activity of the protein. Also, changes in amino acid sequence or composition in regions of the 
molecule, different from those responsible for binding and toxicity (e.g., pore formation) are less 
likely to cause a difference in insecticidal activity of the protein. Such equivalents of the gene 
include DNA sequences hybridizing to the DNA sequence of the Cry toxins or protoxins of SEQ 
ID. No. 2, 4, or 6 under stringent conditions and encoding a protein with the same insecticidal 
characteristics as the (pro)toxin of this invention, or DNA sequences encoding proteins with an 
amino acid sequence identity of at least 85 %, preferably at least 90 % most preferably at least 
95 %, with the protein toxin form (from the N-terminus to 2 amino acids beyond conserved 
sequence block 5 as defined in Schnepf et al., 1998) or with the protein protoxin form of the 
Cry1 Bf, Cry9FAor Cryljd proteins of this invention, as determined using the CAP program of the 
Wisconsin package of GCG (Madison, Wisconsin, USA, version 10.0; CCC defaults were used 
within the CAP program; for the amino acid sequence comparisons, the blosum62 scoring matrix 
was used). 

Of course, any other DNA sequence differing in its codon usage but encoding the same 
protein or a similar protein with substantially the same insecticidal activity, can be constructed, 
depending on the particular purpose. It has been described in prokaryotic and eucaryotic 
expression systems that changing the codon usage to that of the host cell is desired for gene 



9 



expression in foreign hosts (Bennetzen & Hall, 1982; Itakura, 1977). Furthermore, Bt crystal 
protein genes are known to have no bias towards eucaryotic codons, and to be very AT-rich 
(Adang et al., 1985, Schnepf et aL, 1985). Codon usage tables are available in the literature 
(Wada et al., 1 990; Murray et al., 1 989) and in the major DNA sequence databanks (e.g. EMBL 
at Heidelberg, Germany). Accordingly, synthetic DNA sequences can be constructed so that the 
same or substantially the same proteins are produced. It is evident that several DNA sequences 
can be devised once the amino acid sequence of the Cry proteins of this invention is known. 
Such other DNA sequences include synthetic or semi-synthetic DNA sequences that have been 
changed in order to inactivate certain sites in the gene, e.g. by selectively inactivating certain 
cryptic regulatory or processing elements present in the native sequence as described in PCT 
publications WO 91/16432 and WO 93/09218, or by adapting the overall codon usage to that 
of a more related host organism, preferably that of the host organism in which expression is 
desired. When making such genes, the encoded amino acid sequence should be retained to the 
maximum extent possible, although truncations or minor replacements or additions of amino 
acids can be done as long as the toxicity of the protein is not negatively affected. 

Small modifications to a DNA sequence such as described above can be routinely made 
by PCR-mediated mutagenesis (Ho et al.,1 989, White et al., 1 989). 

With the term "substantially the same", when referring to a protein, is meant to include 
a protein that differs in some amino acids, or has some amino acids added (e.g. a fusion protein, 
see Vaeck et al., 1 987) or deleted (e.g. N- or C-terminal truncation), as long as the protein has no 
major difference in its insecticidal activity. 

The term "functional domain" of a Cry toxin as used herein means any part(s) or domain(s) 
of the toxin with a specific structure that can be transferred to another (Cry) protein for providing 
a new hybrid protein with at least one functional characteristic (e.g., the binding and/or toxicity 
characteristics) of the Cry toxin of the invention (Ge et al., 1991). Such parts can form an essential 
feature of the hybrid Bt protein with the binding and/or toxicity characteristics of the Cry protein 
of this invention. Such a hybrid protein can have an enlarged host range, an improved toxicity 
and/or can be used in a strategy to prevent insect resistance development (European Patent 
Publication ("EP") 408 403; Visser et al., 1993). 

The 5 to 10 Kb fragments, prepared from total DNA of the Bt strains of the invention, can 
be ligated in suitable expression vectors and transformed in £. co//, and the clones can then be 
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screened by conventional colony immunoprobing methods (French etal., 1986) for expression 
of the toxin with monoclonal or polyclonal antibodies raised against the Cry proteins, or by 
hybridization with DNA probes. 

Also/the 5 to 10 Kb fragments, prepared from total DNA of the Bt strains of the invention 
or fragments thereof cloned and/or subcloned in E.coli, can be ligated in suitable Bt shuttle vectors 
(Lereclus et al., 1992) and transformed in a crystal minus Bt-mutant. The clones are then 
screened for production of crystals (detected by microscopy) or crystal proteins (detected by SDS- 
PAGE). 

The genes encoding the Cry proteins of this invention can be sequenced in a conventional 
manner (Maxam and Gilbert, 1980; Sanger, 1977) to obtain the DNA sequence. Sequence 
comparisons indicated that the genes are different from previously described genes encoding 
protoxins and toxins with activity against Lepidoptera (Hofte and Whiteley, 1 989; Crickmore, et 
al., 1998); and the December 15, 1999 and October 16, 2000 updates on the Bt nomenclature 
website corresponding to the Crickmore et al. (1998) publication, found at: 

http://epunix.biols.susx.ac.uk/Home/Neil__Crickmore/Bt/index.html 

An insecticidally effective part of the DNA sequences, encoding an insecticidally effective 
portion of the newly identified Cry protein protoxin forms, can be made in a conventional manner 
after sequence analysis of the gene. In such fragments, it is preferred that at least the sequence 
up to the C-terminal end of conserved sequence block 5 of Bt proteins (Hofte & Whiteley, 1 989; 
Schnepf et al., 1998), preferably up to two amino acids C-terminal of the conserved sequence 
block 5, is retained. The amino acid sequence of the Cry proteins can be determined from the 
DNA sequence of the isolated DNA sequences. By "an insecticidally effective part" of DNA 
sequences encoding the Cry protein, also referred to herein as "truncated gene" or "truncated 
DNA", is meant a DNA sequence encoding a polypeptide which has fewer amino acids than the 
Cry protein protoxin form but which is insecticidal to insects. 

In order to express all or an insecticidally effective part of the DNA sequence encoding 
a Cry protein of this invention in f. co//, in other Bt strains and in plants, suitable restriction sites 
can be introduced, flanking the DNA sequence. This can be done by site-directed mutagenesis, 
using well-known procedures (Stanssens et al., 1989; White et al., 1989). In order to obtain 
improved expression in plants, the codon usage of the cry gene or insecticidally effective cry gene 
part of this invention can be modified to form an equivalent, modified or artificial gene or gene 
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part in accordance with PCT publications WO 91/16432 and WO 93/09218; EP 0 358 962 and 
EP 0 359 472, or the Bt genes or gene parts can be inserted in the chloroplast genome and 
expressed there using a chloropast-active promoter (e.g., Mc Bride et al., 1995). For obtaining 
enhanced expression in monocot plants such as corn, a monocot intron also can be added to the 
chimeric gene, and the DNA sequence of the cry gene or its insecticidal part of this invention can 
be further changed in a translationally neutral manner, to modify possibly inhibiting DNA 
sequences present in the gene part by means of site-directed intron insertion and/or by 
introducing changes to the codon usage, e.g., adapting the codon usage to that most preferred 
by the specific plant (Murray et al., 1 989) without changing significantly the encoded amino acid 
sequence. 

Furthermore, the binding properties of the Cry proteins of the invention can be evaluated, 
using methods known in the art (Van Rie et al., 1990), to determine if the Cry proteins of the 
invention bind to sites on the insect midgut that are different from those recognized by other, 
known Cry or other Bt proteins. Bt toxins with different binding sites in relevant susceptible 
insects are very valuable to replace known Bt toxins to which insects may have developed 
resistance, or to use in combination with Bt toxins having a different mode of action to prevent 
or delay the development of insect resistance against Bt toxins, particularly when expressed in a 
plant. Because of the characteristics of the newly isolated Bt toxins, they are extremely useful for 
transforming plants, e.g. monocots such as corn or rice and vegetables such as Brassica species 
plants, to protect these plants from insect damage. 

The insecticidally effective cry gene part or its equivalent, preferably the cry chimeric 
gene, encoding an insecticidally effective portion of the Cry protoxin, can be stably inserted in 
a conventional manner into the nuclear genome of a single plant cell, and the so-transformed 
plant cell can be used in a conventional manner to produce a transformed plant that is 
insect-resistant. In this regard, a disarmed Ti-plasmid, containing the insecticidally effective cry 
gene part, in Agrobacterium tumefaciens can be used to transform the plant cell, and thereafter, 
a transformed plant can be regenerated from the transformed plant cell using the procedures 
described, for example, in EP 0 116 718, EP 0 270 822, PCT publication WO 84/02913 and 
published European Patent application ("EP") 0 242 246 and in Gould et al. (1991). Preferred Ti- 
plasmid vectors each contain the insecticidally effective cry gene part between the border 
sequences, or at least located to the left of the right border sequence, of the T-DNA of the Ti- 
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plasmid. Of course, other types of vectors can be used to transform the plant cell, using 
procedures such as direct gene transfer (as described, for example in EP 0 233 247), pollen 
mediated transformation (as described, for example in EP 0 270 356, PCT publication WO 
85/01856, and US Patent 4,684,61 1 ), plant RNA virus-mediated transformation (as described, for 
example in EP 0 067 553 and US Patent 4,407,956), liposome-mediated transformation (as 
described, for example in US Patent 4,536,475), and other methods such as the recently 
described methods for transforming certain lines of corn (Fromm et al., 1990; Gordon-Kamm et 
ah, 1990) and rice (Shimamoto et aL, 1989; Datta et al., 1990) and the recently described 
method for transforming monocots generally (PCT publication WO 92/09696). 

The resulting transformed plant can be used in a conventional plant breeding scheme to 
produce more transformed plants with the same characteristics or to introduce the insecticidally 
effective cry gene part in other varieties of the same or related plant species. Seeds, which are 
obtained from the transformed plants, contain the insecticidally effective cry gene part as a stable 
genomic insert. Cells of the transformed plant can be cultured in a conventional manner to 
produce the insecticidally effective portion of the Cry protoxin, preferably the Cry toxin, which 
can be recovered for use in conventional insecticide compositions against Lepidoptera (US Patent 
5,254,799). In accordance with this invention, plants or seeds of the invention can be used to 
obtain resistance to insects, e.g. by sowing or planting in a field wherein damaging insects usually 
occur, said seeds or plants. Methods for obtaining insect resistance and methods for obtaining 
improved yield or reduced insect damage are thus provided in accordance with the invention by 
planting or sowing in a field, preferably a field wherein damaging insects feeding on such plants 
usually occur or are expected to occur at levels which provide economic damage to the plants, 
the plants of seeds of the invention producing the Cry proteins of the invention. 

The insecticidally effective cry gene part, preferably the truncated cry gene, is inserted in 
a plant cell genome so that the inserted gene is downstream (i.e., 3') of, and under the control of, 
a promoter which can direct the expression of the gene part in the plant cell. This is preferably 
accomplished by inserting the cry chimeric gene in the plant cell genome, particularly in the 
nuclear or chloroplast genome. Preferred promoters include: the strong constitutive 35S 
promoters (the "35S promoters") of the cauliflower mosaic virus (CaMV) of isolates CM 1841 
(Gardner et al., 1981), CabbB-S (Franck et al., 1980) and CabbB-JI (Hull and Howell, 1987); 
promoters from the ubiquitin family (e.g., the maize ubiquitin promoter of Christensen 
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et al., 1 992, see also Cornejo et al., 1 993), the gos2 promoter (de Pater et al., 1 992), the 
emu promoter (Last et al., 1 990), rice actin promoters such as the promoter described by 
Zhang et al. (1991); and the TR1' promoter and the TR2' promoter (the 'TRT promoter" and 
'TR2' promoter", respectively) which drive the expression of the 1' and 2' genes, respectively, of 
the T-DNA (Velten et al., 1 984). Alternatively, a promoter can be utilized which is not constitutive 
but rather is specific for one or more tissues or organs of the plant (e.g., leaves and/or roots) 
whereby the inserted cry gene part is expressed only in cells of the specific tissue(s) or organ(s). 
For example, the insecticidally effective cry gene part could be selectively expressed in the leaves 
of a plant (e.g., com, cotton) by placing the insecticidally effective gene part under the control of 
a light-inducible promoter such as the promoter of the ribulose-1 ,5-bisphosphate carboxylase 
small subunit gene of the plant itself or of another plant such as pea as disclosed in US Patent 
5,254,799. Another alternative is to use a promoter whose expression is inducible (e.g., by 
temperature, wounding or chemical factors). 

The insecticidally effective cry gene part is inserted in the plant genome so that the 
inserted gene part is upstream (i.e., 5') of suitable 3' end transcription regulation signals (i.e., 
transcript formation and polyadenylation signals). This is preferably accomplished by inserting the 
cry chimeric gene in the plant cell genome. Preferred polyadenylation and transcript formation 
signals include those of the octopine synthase gene (Gielen et al., 1984) and the T-DNA gene 7 
(Velten and Schell, 1985), which act as 3'-untranslated DNA sequences in transformed plant cells. 

The insecticidally effective cry gene part can optionally be inserted in the plant genome 
as a hybrid gene (US Patent 5,254,799; Vaeck et al., 1987) under the control of the same 
promoter as a selectable marker gene, such as the neo gene (EP 0 242 236) encoding kanamycin 
resistance, so that the plant expresses a fusion protein. 

All or part of the cry gene, encoding an anti-lepidopteran protein, can also be used to 
transform other bacteria, such as a B. thuringiensis which has insecticidal activity against 
Lepidoptera or Coleoptera. Thereby, a transformed Bt strain can be produced which is useful for 
combatting a wide spectrum of lepidopteran and coleopteran insect pests or for combatting 
additional lepidopteran insect pests. Transformation of bacteria, such as bacteria of the genus 
Agrobacterium, Bacillus or Escherichia, with all or part of the cry gene of this invention, 
incorporated in a suitable cloning vehicle, can be carried out in a conventional manner, preferably 
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using conventional electroporation techniques as described in Mahillon et al. (1 989) and in PCT 
Patent publication WO 90/06999. 

Transformed Bac/7/us species strains containing the cry gene of this invention can be 
fermented by conventional methods (Dulmage, 1981; Bernhard and Utz, 1993) to provide high 
yields of cells. Under appropriate conditions which are well understood (Dulmage, 1 981 ), these 
strains each sporulate to produce crystal proteins containing the Cry protoxin in high yields. 

An insecticidal, particularly anti-lepidopteran, composition of this invention can be 
formulated in a conventional manner using the microorganisms transformed with the cry gene, 
or preferably their respective Cry proteins or the Cry protoxin, toxin or insecticidally effective 
protoxin portion as an active ingredient, together with suitable carriers, diluents, emulsifiers 
and/or dispersants (e.g., as described by Bernhard and Utz, 1993). This insecticide composition 
can be formulated as a wettable powder, pellets, granules or dust or as a liquid formulation with 
aqueous or non-aqueous solvents as a foam, gel, suspension, concentrate, etc. 

A method for controlling insects, particularly Lepidoptera, in accordance with this 
invention can comprise applying (e.g., spraying), to a locus (area) to be protected, an insecticidal 
amount of the Cry proteins or host cells transformed with the cry gene of this invention. The locus 
to be protected can include, for example, the habitat of the insect pests or growing vegetation or 
an area where vegetation is to be grown. 

To obtain the Cry protoxin or toxin, cells of the recombinant hosts expressing the Cry 
protein can be grown in a conventional manner on a suitable culture medium and then lysed 
using conventional means such as enzymatic degradation or detergents or the like. The protoxin 
can then be separated and purified by standard techniques such as chromatography, extraction, 
electrophoresis, or the like. The toxin can then be obtained by trypsin digestion of the protoxin. 

The following Examples illustrate the invention, and are not provided to limit the invention 
or the protection sought. The sequence listing referred to in the Examples, the Claims and the 
Description is as follows: 

Sequence Listing: 

SEQ ID No. 1 - amino acid and DNA sequence of Cry1 Bf protein and DNA 
SEQ ID No. 2 - amino acid sequence of Cry1 Bf protein. 
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SEQ ID No. 3 - amino acid and DNA sequence of CryUd protein and DNA. 

SEQ ID No. 4 - amino acid sequence Cryljd protein. 

SEQ ID No. 5 - amino acid and DNA sequence of Cry9Fa protein and DNA. 

SEQ ID No. 6 - amino acid sequence of Cry9Fa protein. 

SEQ ID No. 7 - DNA sequence for primer Cryl B.fw. 

SEQ ID No. 8 - DNA sequence for primer B.R. 

SEQ ID No. 9 - DNA sequence for primer B.F. 

SEQ ID No. 10 - DNA sequence for primer JFW. 

SEQ ID No. 1 1 - DNA sequence for primer JRV. 

SEQ ID No. 12- DNA sequence for primer 9FW. 

SEQ ID No. 1 3 - DNA sequence for primer 9RV. 

Unless otherwise stated in the Examples, all procedures for making and manipulating 
recombinant DNA are carried out by the standard procedures described in Sambrook et al., 
Molecular Cloning - A Laboratory Manual, Second Ed., Cold Spring Harbor Laboratory Press, 
NY (1 989), and in Volumes 1 and 2 of Ausubel et al. (1 994) Current Protocols in Molecular 
Biology, Current Protocols, USA. Standard materials and methods for plant molecular biology 
work are described in Plant Molecular Biology Labfax (1993) by R.R.D. Cray, joindy published 
by BIOS Scientific Publications Ltd (UK) and Blackwell Scientific Publications (UK). 
Procedures for PCR technology can be found in "PCR protocols: a guide to methods and 
applications", Edited by M.A. Innis, D.H. Gelfand, J.J. Sninsky and T.J. White (Academic 
Press, Inc., 1 990). 

EXAMPLES 

Fxample 1: Characteri zation of the strains. 

The BtS02072BG strain was isolated from a grain dust sample collected in Santo Tomas 
la Union, llocos, Philippines. The BtS02739C strain was isolated from a grain dust sample 
collected in Lucena City, South Tagalog, Philippines. 

Each strain can be cultivated on conventional standard media, preferably T 3 medium 
(tryptone 3 g/l, tryptose 2 g/l, yeast extract 1 .5 g/l, 5 mg MnCl 2 , 0.05 M Na 2 HP0 4 .2H 2 0, 0.05 M 
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NaH 2 PO 4 .H 2 0, pH 6.8 and 1 .5% agar), preferably at 28 °C. For long term storage, it is preferred 
to mix an equal volume of a spore-crystal suspension with an equal volume of 50% glycerol and 
store this at -70 °C or lyophilize a spore-crystal suspension. For sporulation, growth on T 3 medium 
is preferred for 72 hours at 28 °C, followed by storage at 4 °C. The crystal proteins produced by 
the strains during sporulation are packaged in crystals. 

Fvam ple 2 t Inserticidal activity of the BtS fl2072BG and BtS02739C strains agaipst selected 
lepidopteran insect species. 

Toxicity assays were performed on neonate larvae of Helicoverpa zea, Heliothis virescens, 
Ostrinia nubilaiis, Spodoptera frugiperda and Sesamia nonagrioides fed on an artificial diet layered 
with spore-crystal mixtures from either BtS02072BG or BtS02739C, at about 10 9 spore-crystals 
per ml. 

The artificial diet (Vanderzant, 1962) was dispensed in wells of Costar 24-well plates for tests on 
H. zea, H. virescens and O. nubilaiis. 50 microliter of the spore-crystal mixture was applied on 
the surface of the diet and dried in a laminar air flow. For tests on H. zea, H. virescens, one larva 
was placed in each well and 20 larvae were used per sample. For tests on O. nubilaiis, 2 larvae 
were placed in each well and 24 larvae were used per sample. The artificial diet was dispensed 
in wells of Costar 48-well plates for tests on S. frugiperda and S. nonagrioides. 25 microliter of the 
spore-crystal mixture was applied on the surface of the diet and dried in a laminar air flow. One 
larva was placed in each well and 18 larvae were used per sample. Dead and living larvae were 
counted on the seventh day. The percentage of dead larvae are shown in Table I below. 



Table I: Percentage of dead larvae upon application of crystal-spore mixture to insects: 





BTS02072BC 


BTS02739C 








H. zea 


70 


15 


H. virescens 


85-50 


80-60 


O. nubilaiis 


92 


72 


S. frugiperda 


6 


Not tested 


S. nonagroides 


100 


Not tested 
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Example 3 : Characteri zation of new cry genes 



The BtS02739C genes were detected by PCR using degenerate primers targeting 
conserved regions in known cry genes. The resulting amplification product was purified using 
the Wizard PCR preps (Promega) purification system and ligated into pCEM-T vector (Promega). 
The ligation mixture was electroporated into E. coli JM101 . A miniprep was made of at least 40 
insert-containing transformants, and digests were performed with selected restriction enzymes. 
Following electrophoresis of the digested miniprep DNA, different DNA fragment patterns could 
be observed. For each pattern at least one colony was selected. An appropriate DNA prep was 
made in order to determine the sequence of the insert of the plasmid present in each selected 
colony. Alignment of the determined sequences of the amplification products with publicly 
available cry sequences demonstrates that strain BtS02739C contains a novel cry 1J- type gene and 
a novel cry9- type gene. 

The BtS02072BG gene was detected as follow. First, a PCR was performed using 
degenerate crystal protein gene primers on strain BtS02419J. The resulting amplification product 
was used as template in a secondary PCR using degenerate crystal protein primers 

The resulting amplification product was purified using the Wizard PCR preps (Promega) 
purification system and ligated into pGEM-T vector (Promega). The ligation mixture was 
electroporated into XL1 Blue £. coli. A miniprep was made of at least 40 insert-containing 
transformants, and digests were performed with selected restriction enzymes. Following 
electrophoresis of the digested miniprep DNA, different DNA fragment patterns could be 
observed. For each pattern at least one colony was selected. An appropriate DNA prep was 
made in order to determine the sequence of the insert of the plasmid present in each selected 
colony. 

From the cloned amplification products from strain BtS02419J, a sequence was found to 
be identical to the corresponding fragment of crylBel, except for one nucleotide difference. 
Next, primers were selected to evaluate the presence of a cry sequence similar to that of the 
sequenced cry gene fragment from BtS02419J in a number of Bt strains, one of them being strain 
BtS02072BG. These primers had the following sequence (5' to 3'): 

Forward primer: cryl B.fw: CAG TCC AAA CGC GTA TAA AC 
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Reverse primer: B.R: CTG CTT CGA AGG TTG CAG TA 

Alignment of the determined sequences from the amplification products with publicly 
available cry sequences demonstrates that strain BtS02072BG contains a novel c/ytS-type gene. 

Example 4 : Cloning and expres sion of the cry genes 

In order to isolate the full length crylj- type and c/y9- type gene from BtS02739C, and 
the crylB- type gene from BtS02072BG, total DNA from these strains was prepared and partially 
digested with Sau3A. The digested DNA was size fractionated on a sucrose gradient and 
fragments ranging from 5 Kb to 10 Kb were ligated to the Bam H1 -digested and TsAP 
(thermosensitive alkaline phosphatase)- treated cloning vector pUC19 (Yannisch-Perron et al, 
1985). The ligation mixture was electroporated in E. coli XL1-Blue or E. coli JM109 cells. 
Transformants were plated on LB-triacillin plates containing Xgal and IPTG and white colonies 
were selected to be used in filter hybridization experiments. Recombinant E.coli clones 
containing the vector were then screened with the appropriate DIG labeled probes. These probes 
were prepared as follows. First, a PCR was performed using as template cells from a recombinant 
E. coli clone containing a plasmid harboring the particular cry gene fragment, previously amplified 
using appropriate primers as shown in Table II. 



Table II: primers used to isolate novel Bt DNA sequences (Y= C or T, S= G or C): 



strain 


gene 


primer 


Length of 
amplified 
fragment 


Primer sequence 


2739C 


cry7/-type 


JFW 


365 bp 


CCA GCT AAT CCT ACC ACA TC 






JRV 




CTG CCG GTA TCC TGA CTA AT | 




cry9-type 


9FW 


576 


GYT TTT ATT CGC CCG CCA CA 






9RV 




CGA CAG TAG SAC CCA CTA CT 


2072 BC 


cry IB-type 


B.F 


922 


CAG CGT ATT AAG TCG ATG GA 






B.R 




CTG CTT CGA AGG TTG CAG TA 



The resulting amplification product was gel-purified and used as template in a secondary PCR 
reaction using DIG- labeled dNTPs. An appropriate amount of this amplification product was 
used in hybridization reactions. 
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Colony hybridization for strain BtS02739C was performed with a mixture of the cry! J- type probe 
and the cry9- type probe. Positive colonies were then hybridized with each probe separately. 
Colony hybridization for strain BtS02072BC was performed with the cryW- type probe. 
Following identification of a positive colony containing a plasmid harboring the full length cry 
gene, the sequence of the cry gene was determined using the dye terminator labeling method 
and a Perkin Elmer ABI Prism-377 DNA sequencer for both strands. Upon DNA sequencing, the 
genes were termed as follows: the crylj- type and cry9- type gene from BtS02739C were named 
cryljd and cry9Fa, respectively, and the crylB- type gene from BtS02072BG was named crylBf. 
The genomic sequences of the isolated cryljd, cry9Fa, and crylBf genes, as well as the proteins 
they encode, are shown in the Sequence Listing included in this application. Comparison of the 
sequences with known Cry DNA or protein sequences showed that the sequences are novel and 
differ in a substantial number of nucleotides or amino acids from known Bt genes and proteins. 
Tables III-V provide an overview of the sequence identity with respect to the coding regions of 
the most similar genes and proteins (both protoxin as toxin forms) as determined using the GAP 
program of the Wisconsin package of GCG (Madison, Wisconsin, USA) version 10.0. GCG 
defaults were used within the GAP program. For nucleic acid sequence comparisons, the 
nwsgapdna scoring matrix was used, for amino acid sequence comparisons, the blosum62 scoring 
matrix. The toxin form, as used in Tables lll-V, refers to the protein starting at the first amino acid 
and ending two amino acids beyond the last amino acid (usually a proline) of conserved sequence 
block 5, as defined in Schnepf et al. (1998). The protoxin form refers to the entire protein or 
coding region of the Bt protein/gene. 



Table III: Sequence identities for crylBflCrylBi: 



DNA 


crylBal 


crylBbl 


crylBcl 


crylBdl 


crylBel 














Protoxin 


91.912 


83.890 


11201 


83.565 


93.774 


Toxin 


86.562 


74.922 


74.922 


75.342 


89.220 














Protein 


CrylBal 


CrylBbl 


CrylBcl 


CrylBdl 


CrylBel 














Protoxin 


89.869 


80.193 


75.795 


80.933 


92.170 


Toxin 


82.520 


67.868 


67.868 


70.142 


86.499 
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Table IV: Sequence identities for ay9FalCry9Va.: 



DNA 


cry9Aal 


cry9Bal 


cry9Cal 


cry9Dal 


cry9Eal 














Protoxin 


71.592 


78.212 


76.614 


81.197 


84.043 


Toxin | 


51.782 


62.720 


68.215 


75.593 


81.618 














Protein 


Cry9Aal 


Cry9Bal 


Cry9Cal 


Cry9Dal 


Cry9Eal 














Protoxin 


62.445 


72.064 


71.553 


76.963 


82.578 


Toxin 


35.828 


52.167 


59.133 


68.372 


78.858 















Table V: Sequence identities for cryUd/CvyUd: 



DNA 


crylJal 


crylJbl 


crylJcl 










protoxin 


83.233 


83.176 


86.323 


toxin 


79.526 


81.162 


88.143 










protein 


CrylJal 


CrylJbl 


CrylJcl 










protoxin 


79.759 


78.830 


82.489 


toxin 


71.574 


74.746 


81.711 











Genomic clones of the newly isolated genes have been deposited at the BCCM™-LMBP (Belgian 
Coordinated Collections of Microorganisms - Laboratorium voor Moleculaire Biologie- 
Plasmidencollectie, University of Gent, K.L Ledeganckstraat 35, B-9000 Gent, Belgium) under 
the following accession numbers: 

- LMBP 3983 for E coli JM109 containing plasmid pUC2739C/1Jd1 comprising the cryljd gene, 
deposited on November 25, 1999 (this gene can be isolated from this plasmid on an about 8.4 
kb DNA fragment by digestion with Xhol and Smal); 

- LMBP 3984 for E coli JM109 containing plasmid pUC2739C/9Fa1 comprising the cry9Fa gene, 
deposited on November 25, 1999 (this gene can be isolated from this plasmid on an about 8 kb 
DNA fragment by digestion with Sacl and Pstl); and 

- LMBP 3986 for E coli XLIBlue containing plasmid pUC2072BG/1 Bf1 comprising the crylBf 
gene, deposited on November 25, 1999 (this gene can be isolated from this plasmid on an about 
7 kb DNA fragment by digestion with Sacl and Sail). 
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Example 5: I nsecticidal activity of the cry genes: 



The insert containing the cry9Fa gene was subcloned into a suitable shuttle vector and the 
resulting plasmid pSL2739C/9Fa1 was introduced by routine procedures into a crystal-minus Bt 
strain. The crystal protein produced by a sporulated culture of this recombinant Bt strain was 
tested on neonate larvae of H. virescens and O. nubilalis at a concentration of about 10 9 
particles/ml. On O. nubilalis larvae, 100% mortality was observed, whereas 72% mortality was 
observed on H. virescens lan/ae, whereas after treatment with the crystal-minus control strain all 
larvae survived. 

The insert containing the crylBf gene was subcloned into a suitable shuttle vector and the 
resulting plasmid pSL2072BG/1 Bf was introduced by routine procedures into a crystal-minus 
Bt strain. The crystal protein produced by a sporulated culture of this recombinant Bt strain 
was tested on larvae of Sesamia nonagrioides, Heliothis virescens, Helicoverpa zea and O. 
nubilalis at different concentrations. Significant high mortality of the Cryl Bf toxin was 
observed on H. virescens, Ostrinia nubilialis and Sesamia nonagrioides, while lower toxicity 
was found on Helicoverpa zea. After treatment with the crystal-minus control strain all larvae 
survived. 

The insert containing the cryljd gene was subcloned into a suitable shuttle vector and the 
resulting plasmid pGI2739C/1Jd was introduced by routine procedures into a crystal-minus Bt 
strain. The crystal protein produced by a sporulated culture of this recombinant fit strain is 
tested on larvae of Heliothis virescens at different concentrations, and significant mortality of 
the Cryljd toxin was observed. After treatment with the crystal-minus control strain all larvae 
survived. 

Example 6: production of the novel Cry proteins in transformed plants. 

Chimeric genes encoding the truncated forms of the Cry1 Bf, Cryljd, and Cry9Fa proteins are 
made as described in EP 0 193 259 and published PCT patent application WO 94/12264, 
using the CaMV 35S (Hull and Howell, 1987) and ubiquitin (Christensen et al v 1992) 
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promoters. Preferably, the codon usage of the open reading frame is adapted to that of the 
host plant so as to optimize expression efficiency, as described in published PCT patent 
application WO 94/1 2264. 

Rice, cotton and corn cells are transformed with the resulting chimeric genes. 
Corn cells are stably transformed by either Agrobarterium-med iated transformation (Ishida et 
al., 1996, and U.S. Patent No. 5,591,616) or by electroporation using wounded and enzyme- 
degraded embryogenic callus, as described in WO 92/09696 or US Patent 5,641,664 
(incorporated herein by reference). 

Cotton cells are stably transformed by Agrobacterium-mediated transformation (Umbeck et al., 
1987, Bio/Technology 5, 263-266; US Patent 5,004,863, incorporated herein by reference). 
Rice cells are stably transformed with the method described in published PCT patent 
application WO 92/09696. 

Regenerated transformed corn, cotton and rice plants are selected by ELISA, Northern and 
Southern blot and insecticidal effect. Chimeric gene-containing progeny plants show improved 
resistance to insects compared to untransformed control plants with appropriate segregation of 
insect resistance and the transformed phenotype. Protein and RNA measurements show that 
increased insect resistance is linked with higher expression of the novel Cry protein in the 
plants. 
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SEQUENCE LISTING 



<110> Arnaut, Greta 
Boets, Annemie 
Damme, Nicole 
Mathieu, Eva 
Vanneste, Stijn 
Van Rie, Jeroen 

<120> Insecticidal proteins from Bacillus thuringiensis . 

<130> NEWBTSUS2 

<140> 
<141> 

<150> 60/173387 

<151> 1999-12-28 

<160> 13 

<170> Patentln Ver. 2.0 

<210> 1 
<211> 3687 
<212> DNA 

<213> Bacillus thuringiensis 

<220> 

<221> CDS 

<222> (1) . . (3687) 

<400> 1 

ttg act tea aat agg aaa aat gag aat gaa att ata aat get tta teg 4 8 

Leu Thr Ser Asn Arg Lys Asn Glu Asn Glu lie lie Asn Ala Leu Ser 

1 5 10 15 

att cca get gta teg aat cat tec aca caa atg gat eta tea cca gat 96 
lie Pro Ala Val Ser Asn His Ser Thr Gin Met Asp Leu Ser Pro Asp 
20 25 30 

get cgt att gag gat tct ttg tgt ata gee gag ggg aat aat ate aat 144 
Ala Arg He Glu Asp Ser Leu Cys He Ala Glu Gly Asn Asn He Asn 
35 40 45 

cca ctt gtt age gca tea aca gtc caa acg ggt att aac ata get ggt 192 
Pro Leu Val Ser Ala Ser Thr Val Gin Thr Gly He Asn He Ala Gly 
50 55 60 
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aga ata eta 
Arg lie Leu 
65 

ttt tat agt 
Phe Tyr Ser 



tgg gaa att 
Trp Glu lie 



aca gaa aat 
Thr Glu Asn 
115 

gat tec ttt 
Asp Ser Phe 
130 

cgt gat gat 
Arg Asp Asp 
14 5 

tta gaa ctt 
Leu Glu Leu 



caa gaa gtt 
Gin Glu Val 



eta tta tta 
Leu Leu Leu 
195 

aca teg cag 
Thr Ser Gin 
210 

aga gat tat 
Arg Asp Tyr 
225 

age ttg aga 
Ser Leu Arg 



cgt aga gat 
Arg Arg Asp 



ggt gta tta ggc 
Gly Val Leu Gly 
70 

ttt ctt gtt ggt 
Phe Leu Val Gly 
85 

ttc eta gaa cat 
Phe Leu Glu His 
100 

get agg aat acg 
Ala Arg Asn Thr 



aga gee tat caa 
Arg Ala Tyr Gin 
135 

gca aga acg aga 
Ala Arg Thr Arg 
15 0 

gat ttt ctt aat 
Asp Phe Leu Asn 
165 

cca tta tta atg 
Pro Leu Leu Met 
180 

ttg aga gat gee 
Leu Arg Asp Ala 



gaa att caa cgt 
Glu lie Gin Arg 
215 

tec gac tat tgc 
Ser Asp Tyr Cys 
230 

ggg aca aat gee 
Gly Thr Asn Ala 
245 

eta acg tta ggg 
Leu Thr Leu Gly 
260 



gta ccg ttt 
Val Pro Phe 



gaa tta tgg 
Glu Leu Trp 
90 

gtc gaa caa 
Val Glu Gin 
105 

gca ctt get 
Ala Leu Ala 
120 

cag tea ctt 
Gin Ser Leu 



agt gtt ctt 
Ser Val Leu 



gcg atg ccg 
Ala Met Pro 
170 

gta tat get 
Val Tyr Ala 
185 

tct ctt ttt 
Ser Leu Phe 
200 

tat tat gag 
Tyr Tyr Glu 



gta gaa tgg 
Val Glu Trp 



gca agt tgg 
Ala Ser Trp 
250 

gta tta gat 
Val Leu Asp 
265 



get gga caa ata 
Ala Gly Gin He 
75 

ccc cgc ggc aga 
Pro Arg Gly Arg 



ctt ata aat caa 
Leu He Asn Gin 
110 

cga tta caa ggt 
Arg Leu Gin Gly 
125 

gaa gat tgg eta 
Glu Asp Trp Leu 
140 

tat ace caa tat 
Tyr Thr Gin Tyr 
155 

ctt ttc gca att 
Leu Phe Ala He 



caa get gca aat 
Gin Ala Ala Asn 
190 

ggt agt gaa ttt 
Gly Ser Glu Phe 
205 

cgc caa gtg gaa 
Arg Gin Val Glu 
220 

tat aat aca ggt 
Tyr Asn Thr Gly 
235 

gtg cgt tat aat 
Val Arg Tyr Asn 



eta gtg gca eta 
Leu Val Ala Leu 
270 



get agt 240 
Ala Ser 
80 

gat cag 2 88 
Asp Gin 
95 

caa ata 336 
Gin He 



tta gga 3 84 
Leu Gly 



gaa aac 432 
Glu Asn 



ata gee 480 
He Ala 
160 

aga aac 52 8 

Arg Asn 

175 

tta cac 576 
Leu His 



ggg ctt 624 
Gly Leu 



caa acg 672 
Gin Thr 



eta aat 720 
Leu Asn 
240 

caa ttc 768 

Gin Phe 

255 

ttc cca 816 
Phe Pro 



28 



912 



960 



age tat gac act cgc act tat cca ata aat acg agt get cag tta aca 864 
Ser Tyr Asp Thr Arg Thr Tyr Pro lie Asn Thr Ser Ala Gin Leu Thr 
275 280 285 

agg gaa gtt tat aca gac gca att gga gca aca ggg gta aat atg gca 
Arg Glu Val Tyr Thr Asp Ala lie Gly Ala Thr Gly Val Asn Met Ala 
290 295 300 

agt atg aat tgg tat aat aat aat gca cct teg ttt tec get ata gag 
Ser Met Asn Trp Tyr Asn Asn Asn Ala Pro Ser Phe Ser Ala lie Glu 
305 310 315 320 

act gcg gtt ate cga age ccg cat eta ctt gat ttt eta gaa caa ctt 1008 
Thr Ala Val lie Arg Ser Pro His Leu Leu Asp Phe Leu Glu Gin Leu 
325 330 335 

aca att ttt age act tea tea cga tgg agt get act agg cat atg act 1056 
Thr' lie Phe Ser Thr Ser Ser Arg Trp Ser Ala Thr Arg His Met Thr 
340 345 350 

tac tgg egg ggg cac aca att caa tct egg cca ata gga ggc gga tta 1104 
Tyr Trp Arg Gly His Thr lie Gin Ser Arg Pro lie Gly Gly Gly Leu 
355 360 365 

aat acc tea acg cat ggg tct ace aat act tct att aat cct gta aga 1152 
Asn Thr Ser Thr His Gly Ser Thr Asn Thr Ser lie Asn Pro Val Arg 
370 375 380 

tta tea ttc ttc tct cga gac gta tat tgg act gaa tea tat gca gga 1200 
Leu Ser Phe Phe Ser Arg Asp Val Tyr Trp Thr Glu Ser Tyr Ala Gly 
385 390 395 400 

gtg ctt eta tgg gga att tac ctt gaa cct att cat ggt gtc cct act 1248 
Val Leu Leu Trp Gly lie Tyr Leu Glu Pro lie His Gly Val Pro Thr 
405 410 415 

gtt aga ttt aat ttt agg aac cct cag aat act ttt gaa aga ggt act 12 96 
Val Arg Phe Asn Phe Arg Asn Pro Gin Asn Thr Phe Glu Arg Gly Thr 
420 425 430 

get aac tat agt caa ccc tat gag tea cct ggg ctt caa tta aaa gat 1344 
Ala Asn Tyr Ser Gin Pro Tyr Glu Ser Pro Gly Leu Gin Leu Lys Asp 
435 440 445 

tea gaa act gaa tta cca cca gaa aca aca gaa cga cca aat tat gaa 1392 
Ser Glu Thr Glu Leu Pro Pro Glu Thr Thr Glu Arg Pro Asn Tyr Glu 
450 455 460 

tea tat agt cat agg tta tct cac ata ggg etc att tea caa tct agg 144 0 
Ser Tyr Ser His Arg Leu Ser His lie Gly Leu lie Ser Gin Ser Arg 
465 470 475 480 
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gtg cat gta cca gta tat tct tgg acg cac cgt agt gca gat cgt aca 14 88 
Val His Val Pro Val Tyr Ser Trp Thr His Arg Ser Ala Asp Arg Thr 
485 490 495 

aat acc att agt tea gat age ata aca caa ata cca ttg gta aaa tea 1536 
Asn Thr He Ser Ser Asp Ser He Thr Gin He Pro Leu Val Lys Ser 
500 505 510 

ttc aac ctt aat tea ggt acc tct gta gtc agt ggc cca gga ttt aca 1584 
Phe Asn Leu Asn Ser Gly Thr Ser Val Val Ser Gly Pro Gly Phe Thr 
515 520 525 

gga ggg gat ata ate cga act aac gtt aat ggt agt gta eta agt atg 1632 
Gly Gly Asp He He Arg Thr Asn Val Asn Gly Ser Val Leu Ser Met 
530 535 540 

ggt ctt aat ttt aat aat aca tea tta cag egg tat cgc gtg aga gtt 1680 
Gly Leu Asn Phe Asn Asn Thr Ser Leu Gin Arg Tyr Arg Val Arg Val 
545 550 555 560 

cgt tat get get tct caa aca atg gtc ctg agg gta act gtc gga ggg 172 8 
Arg Tyr Ala Ala Ser Gin Thr Met Val Leu Arg Val Thr Val Gly Gly 
565 570 575 

agt act act ttt gat caa gga ttc cct agt act atg agt gca aat gag 1776 
Ser Thr Thr Phe Asp Gin Gly Phe Pro Ser Thr Met Ser Ala Asn Glu 
580 585 590 

tct ttg aca tct caa tea ttt aga ttt gca gaa ttt cct gta ggt att 1824 
Ser Leu Thr Ser Gin Ser Phe Arg Phe Ala Glu Phe Pro Val Gly He 
595 600 605 

agt gca tct ggc agt caa act get gga ata agt ata agt aat aat gca 1872 
Ser Ala Ser Gly Ser Gin Thr Ala Gly He Ser He Ser Asn Asn Ala 
610 615 620 

ggt aga caa acg ttt cac ttt gat aaa att gaa ttc att cca att act 1920 
Gly Arg Gin Thr Phe His Phe Asp Lys He Glu Phe He Pro He Thr 
625 630 635 640 

gca acc ttc gaa gca gaa tac gat tta gaa agg gcg caa gag gcg gtg 1968 
Ala Thr Phe Glu Ala Glu Tyr Asp Leu Glu Arg Ala Gin Glu Ala Val 
645 650 655 



aat get ctg ttt act aat acg aat cca aga aga ttg aaa aca gat gtg 

Asn Ala Leu Phe Thr Asn Thr Asn Pro Arg Arg Leu Lys Thr Asp Val 
660 665 670 

aca gat tat cat att gat caa gta tec aat tta gtg gcg tgt tta teg 

Thr Asp Tyr His He Asp Gin Val Ser Asn Leu Val Ala Cys Leu Ser 
675 680 685 



2016 



2064 



30 



2304 



2352 



2400 



gat gaa ttc tgc tta gat gaa aag aga gaa tta ctt gag aaa gtg aaa 2112 
Asp Glu Phe Cys Leu Asp Glu Lys Arg Glu Leu Leu Glu Lys Val Lys 
690 695 700 

tat gcg aaa cga etc agt gat gaa aga aac tta etc caa gat cca aac 2160 
Tyr Ala Lys Arg Leu Ser Asp Glu Arg Asn Leu Leu Gin Asp Pro Asn 
705 710 715 720 

ttc aca tec ate aat aag caa cca gac ttc ata tct act aat gag caa 2208 
Phe Thr Ser lie Asn Lys Gin Pro Asp Phe lie Ser Thr Asn Glu Gin 
725 730 735 

teg aat ttc aca tct ate cat gaa caa tct gaa cat gga tgg tgg gga 2256 
Ser Asn Phe Thr Ser lie His Glu Gin Ser Glu His Gly Trp Trp Gly 
740 745 750 

agt gag aac att aca ate cag gaa gga aat gac gta ttt aaa gag aat 
Ser Glu Asn lie Thr lie Gin Glu Gly Asn Asp Val Phe Lys Glu Asn 
755 760 765 

tac gtc aca eta ccg ggg act ttt aat gag tgt tat ccg acg tat tta 
Tyr Val Thr Leu Pro Gly Thr Phe Asn Glu Cys Tyr Pro Thr Tyr Leu 
770 775 780 

tat caa aaa ata gga gag teg gaa tta aaa get tat act cgc tac caa 
Tyr Gin Lys He Gly Glu Ser Glu Leu Lys Ala Tyr Thr Arg Tyr Gin 
785 790 795 800 

tta aga ggg tat att gaa gat agt caa gat tta gag ata tat ttg att 244 8 
Leu Arg Gly Tyr He Glu Asp Ser Gin Asp Leu Glu He Tyr Leu He 
805 810 815 

cgt tat aat gcg aaa cat gaa aca ttg gat gtt cca ggt ace gag tec 2496 
Arg Tyr Asn Ala Lys His Glu Thr Leu Asp Val Pro Gly Thr Glu Ser 
820 825 830 

gta tgg ccg ctt tea gtt gaa age cca ate gga agg tgc gga gaa ccg 2544 
Val Trp Pro Leu Ser Val Glu Ser Pro He Gly Arg Cys Gly Glu Pro 
835 840 845 

aat cga tgc gca cca cat ttt gaa tgg aat cct gat eta gat tgt tec 2592 
Asn Arg Cys Ala Pro His Phe Glu Trp Asn Pro Asp Leu Asp Cys Ser 
850 855 860 

tgc aga gat gga gaa aaa tgt gcg cat cat tec cat cat ttc tct ttg 264 0 
Cys Arg Asp Gly Glu Lys Cys Ala His His Ser His His Phe Ser Leu 
865 870 875 880 

gat att gat att gga tgc aca gac ttg cat gag aat eta ggc gtg tgg 2 688 
Asp He Asp He Gly Cys Thr Asp Leu His Glu Asn Leu Gly Val Trp 
885 890 895 



31 



gtg gta ttc aag att aag acg cag gaa ggt cat gca aga eta ggg aat 2736 
Val Val Phe Lys lie Lys Thr Gin Glu Gly His Ala Arg Leu Gly Asn 
900 905 910 

ctg gaa ttt att gaa gag aaa cca tta tta gga gaa gca ctg tct cgt 2784 
Leu Glu Phe lie Glu Glu Lys Pro Leu Leu Gly Glu Ala Leu Ser Arg 
915 920 925 

gtg aag aga gca gag aaa aaa tgg aga gac aaa cgt gaa aaa eta caa 2 832 
Val Lys Arg Ala Glu Lys Lys Trp Arg Asp Lys Arg Glu Lys Leu Gin 
930 935 940 

ttg gaa aca aaa cga gta tat aca gag gca aaa gaa get gtg gat get 2 880 
Leu Glu Thr Lys Arg Val Tyr Thr Glu Ala Lys Glu Ala Val Asp Ala 
945 950 955 960 

tta ttt gta gat tct caa tat aat aga tta caa gcg gat aca aac att 
Leu Phe Val Asp Ser Gin Tyr Asn Arg Leu Gin Ala Asp Thr Asn lie 
965 970 975 

ggc atg att cat gcg gca gat aaa ctt gtt cat cga att cga gag get 
Gly Met He His Ala Ala Asp Lys Leu Val His Arg He Arg Glu Ala 
980 985 990 

tat ctg tea gaa tta tct gtt ate ccg ggt gta aat gcg gaa att ttt 
Tyr Leu Ser Glu Leu Ser Val He Pro Gly Val Asn Ala Glu He Phe 
995 1000 1005 

gaa gaa tta gaa ggt cgc att ate act gca ate tec eta tac gat gcg 
Glu Glu Leu Glu Gly Arg He He Thr Ala He Ser Leu Tyr Asp Ala 
1010 1015 1020 

aga aat gtc gtt aaa aat ggt gat ttt aat aat gga tta gca tgc tgg 
Arg Asn Val Val Lys Asn Gly Asp Phe Asn Asn Gly Leu Ala Cys Trp 
1025 1030 1035 1040 

aat gta aaa ggg cat gta gat gta caa cag age cat cac cgt tct gtc 3168 
Asn Val Lys Gly His Val Asp Val Gin Gin Ser His His Arg Ser Val 
1045 1050 1055 

ctt gtt ate cca gaa tgg gaa gca gaa gtg tea caa gca gtt cgc gtc 3216 
Leu Val He Pro Glu Trp Glu Ala Glu Val Ser Gin Ala Val Arg Val 
1060 1065 1070 

tgt ccg ggg cgt ggc tat ate etc cgt gtc aca gcg tac aaa gag gga 3264 
Cys Pro Gly Arg Gly Tyr He Leu Arg Val Thr Ala Tyr Lys Glu Gly 
1075 1080 1085 



2928 



2976 



3024 



3072 



3120 



tat gga gag ggt tgt gta acg ate cat gaa ate gag aac aat aca gac 
Tyr Gly Glu Gly Cys Val Thr He His Glu He Glu Asn Asn Thr Asp 
1090 1095 1100 



32 



3312 



gaa eta aaa ttt aaa aac tgt gaa gaa gag gaa gtg tat cca acg gat 3360 
Glu Leu Lys Phe Lys Asn Cys Glu Glu Glu Glu Val Tyr Pro Thr Asp 
1105 1110 1115 1120 



aca gga acg tgt aat gat tat act gca cac caa ggt aca gca gta tgt 
Thr Gly Thr Cys Asn Asp Tyr Thr Ala His Gin Gly Thr Ala Val Cys 
1125 1130 1135 



3408 



3552 



3600 



aat tec cgt aat get gga tat gag gat gca tat gaa gtt gat act aca 3456 
Asn Ser Arg Asn Ala Gly Tyr Glu Asp Ala Tyr Glu Val Asp Thr Thr 
1140 1145 1150 

gca tct gtt aat tac aaa ccg act tat gaa gaa gaa acg tat aca gat 3504 
Ala Ser Val Asn Tyr Lys Pro Thr Tyr Glu Glu Glu Thr Tyr Thr Asp 
1155 1160 1165 

gta cga aga gat aat cat tgt gaa tat gac aga ggg tat gtg aat tat 
Val Arg Arg Asp Asn His Cys Glu Tyr Asp Arg Gly Tyr Val Asn Tyr 
1170 1175 1180 

cca cca eta cca get ggt tat atg aca aaa gaa tta gaa tac ttc cca 
Pro Pro Leu Pro Ala Gly Tyr Met Thr Lys Glu Leu Glu Tyr Phe Pro 
1185 1190 1195 1200 

gaa ace gat aag gta tgg att gag att gga gaa acg gaa ggg aag ttt 3 648 
Glu Thr Asp Lys Val Trp lie Glu lie Gly Glu Thr Glu Gly Lys Phe 
1205 1210 1215 

att gta gac age gtg gaa tta etc ctt atg gag gaa tag 3 687 

lie Val Asp Ser Val Glu Leu Leu Leu Met Glu Glu 
1220 1225 



<210> 2 
<211> 1228 
<212> PRT 

<213> Bacillus thuringiensis 
<400> 2 

Leu Thr Ser Asn Arg Lys Asn Glu Asn Glu lie lie Asn Ala Leu Ser 
1 5 10 15 

lie Pro Ala Val Ser Asn His Ser Thr Gin Met Asp Leu Ser Pro Asp 
20 25 30 

Ala Arg lie Glu Asp Ser Leu Cys lie Ala Glu Gly Asn Asn lie Asn 
35 40 45 

Pro Leu Val Ser Ala Ser Thr Val Gin Thr Gly lie Asn lie Ala Gly 
50 55 60 
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Arg lie Leu Gly Val Leu Gly Val Pro Phe Ala Gly Gin lie Ala Ser 
65 70 75 80 



Phe Tyr Ser Phe Leu Val Gly Glu Leu Trp Pro Arg Gly Arg Asp Gin 
85 90 95 

Trp Glu lie Phe Leu Glu His Val Glu Gin Leu lie Asn Gin Gin lie 
100 105 110 

Thr Glu Asn Ala Arg Asn Thr Ala Leu Ala Arg Leu Gin Gly Leu Gly 
115 120 125 

Asp Ser Phe Arg Ala Tyr Gin Gin Ser Leu Glu Asp Trp Leu Glu Asn 
130 135 140 

Arg Asp Asp Ala Arg Thr Arg Ser Val Leu Tyr Thr Gin Tyr lie Ala 
145 150 155 160 

Leu Glu Leu Asp Phe Leu Asn Ala Met Pro Leu Phe Ala lie Arg Asn 
165 170 175 

Gin Glu Val Pro Leu Leu Met Val Tyr Ala Gin Ala Ala Asn Leu His 
180 185 190 

Leu Leu Leu Leu Arg Asp Ala Ser Leu Phe Gly Ser Glu Phe Gly Leu 
195 200 205 

Thr Ser Gin Glu lie Gin Arg Tyr Tyr Glu Arg Gin Val Glu Gin Thr 
210 215 220 

Arg Asp Tyr Ser Asp Tyr Cys Val Glu Trp Tyr Asn Thr Gly Leu Asn 
225 230 235 240 

Ser Leu Arg Gly Thr Asn Ala Ala Ser Trp Val Arg Tyr Asn Gin Phe 
245 250 255 

Arg Arg Asp Leu Thr Leu Gly Val Leu Asp Leu Val Ala Leu Phe Pro 
260 265 270 

Ser Tyr Asp Thr Arg Thr Tyr Pro lie Asn Thr Ser Ala Gin Leu Thr 
275 280 285 

Arg Glu Val Tyr Thr Asp Ala lie Gly Ala Thr Gly Val Asn Met Ala 
290 295 300 

Ser Met Asn Trp Tyr Asn Asn Asn Ala Pro Ser Phe Ser Ala He Glu 
305 310 315 320 

Thr Ala Val He Arg Ser Pro His Leu Leu Asp Phe Leu Glu Gin Leu 
325 330 335 
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Thr lie Phe Ser Thr Ser Ser Arg Trp Ser Ala Thr Arg His Met Thr 
340 345 350 



Tyr Trp Arg Gly His Thr lie Gin Ser Arg Pro lie Gly Gly Gly Leu 
355 360 365 

Asn Thr Ser Thr His Gly Ser Thr Asn Thr Ser lie Asn Pro Val Arg 
370 375 380 

Leu Ser Phe Phe Ser Arg Asp Val Tyr Trp Thr Glu Ser Tyr Ala Gly 
385 390 395 400 

Val Leu Leu Trp Gly lie Tyr Leu Glu Pro lie His Gly Val Pro Thr 
405 410 415 

Val Arg Phe Asn Phe Arg Asn Pro Gin Asn Thr Phe Glu Arg Gly Thr 
420 425 430 

Ala Asn Tyr Ser Gin Pro Tyr Glu Ser Pro Gly Leu Gin Leu Lys Asp 
435 440 445 

Ser Glu Thr Glu Leu Pro Pro Glu Thr Thr Glu Arg Pro Asn Tyr Glu 
450 455 460 

Ser Tyr Ser His Arg Leu Ser His He Gly Leu He Ser Gin Ser Arg 
465 470 475 480 

Val His Val Pro Val Tyr Ser Trp Thr His Arg Ser Ala Asp Arg Thr 
485 490 495 

Asn Thr He Ser Ser Asp Ser He Thr Gin He Pro Leu Val Lys Ser 
500 505 510 

Phe Asn Leu Asn Ser Gly Thr Ser Val Val Ser Gly Pro Gly Phe Thr 
515 520 525 

Gly Gly Asp He He Arg Thr Asn Val Asn Gly Ser Val Leu Ser Met 
530 535 540 

Gly Leu Asn Phe Asn Asn Thr Ser Leu Gin Arg Tyr Arg Val Arg Val 
545 550 ' 555 560 

Arg Tyr Ala Ala Ser Gin Thr Met Val Leu Arg Val Thr Val Gly Gly 
565 570 575 

Ser Thr Thr Phe Asp Gin Gly Phe Pro Ser Thr Met Ser Ala Asn Glu 
580 585 590 

Ser Leu Thr Ser Gin Ser Phe Arg Phe Ala Glu Phe Pro Val Gly He 
595 600 605 
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Ser Ala Ser Gly Ser Gin Thr Ala Gly lie Ser lie Ser Asn Asn Ala 
610 615 620 

Gly Arg Gin Thr Phe His Phe Asp Lys lie Glu Phe lie Pro lie Thr 
625 630 635 640 

Ala Thr Phe Glu Ala Glu Tyr Asp Leu Glu Arg Ala Gin Glu Ala Val 
645 650 655 

Asn Ala Leu Phe Thr Asn Thr Asn Pro Arg Arg Leu Lys Thr Asp Val 
660 665 670 

Thr Asp Tyr His lie Asp Gin Val Ser Asn Leu Val Ala Cys Leu Ser 
675 680 685 

Asp Glu Phe Cys Leu Asp Glu Lys Arg Glu Leu Leu Glu Lys Val Lys 
690 695 700 

Tyr Ala Lys Arg Leu Ser Asp Glu Arg Asn Leu Leu Gin Asp Pro Asn 
705 710 ~ 715 720 

Phe Thr Ser lie Asn Lys Gin Pro Asp Phe lie Ser Thr Asn Glu Gin 
725 730 735 

Ser Asn Phe Thr Ser He His Glu Gin Ser Glu His Gly Trp Trp Gly 
740 745 750 

Ser Glu Asn He Thr He Gin Glu Gly Asn Asp Val Phe Lys Glu Asn 
755 760 765 

Tyr Val Thr Leu Pro Gly Thr Phe Asn Glu Cys Tyr Pro Thr Tyr Leu 
770 775 780 

Tyr Gin Lys He Gly Glu Ser Glu Leu Lys Ala Tyr Thr Arg Tyr Gin 
785 790 795 800 

Leu Arg Gly Tyr He Glu Asp Ser Gin Asp Leu Glu He Tyr Leu He 
805 810 815 

Arg Tyr Asn Ala Lys His Glu Thr Leu Asp Val Pro Gly Thr Glu Ser 
820 825 830 

Val Trp Pro Leu Ser Val Glu Ser Pro He Gly Arg Cys Gly Glu Pro 
835 840 845 

Asn Arg Cys Ala Pro His Phe Glu Trp Asn Pro Asp Leu Asp Cys Ser 
850 855 860 

Cys Arg Asp Gly Glu Lys Cys Ala His His Ser His His Phe Ser Leu 
865 870 875 880 
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Asp lie Asp lie Gly Cys Thr Asp Leu His Glu Asn Leu Gly Val Trp 
885 890 895 



Val Val Phe Lys lie Lys Thr Gin Glu Gly His Ala Arg Leu Gly Asn 
900 905 910 

Leu Glu Phe lie Glu Glu Lys Pro Leu Leu Gly Glu Ala Leu Ser Arg 
915 920 925 

Val Lys Arg Ala Glu Lys Lys Trp Arg Asp Lys Arg Glu Lys Leu Gin 
930 935 940 

Leu Glu Thr Lys Arg Val Tyr Thr Glu Ala Lys Glu Ala Val Asp Ala 
945 950 955 960 

Leu Phe Val Asp Ser Gin Tyr Asn Arg Leu Gin Ala Asp Thr Asn He 
965 970 975 

Gly Met He His Ala Ala Asp Lys Leu Val His Arg He Arg Glu Ala 
980 985 990 

Tyr Leu Ser Glu Leu Ser Val He Pro Gly Val Asn Ala Glu He Phe 
995 1000 1005 

Glu Glu Leu Glu Gly Arg He He Thr Ala He Ser Leu Tyr Asp Ala 
1010 1015 1020 

Arg Asn Val Val Lys Asn Gly Asp Phe Asn Asn Gly Leu Ala Cys Trp 
025 1030 1035 1040 

Asn Val Lys Gly His Val Asp Val Gin Gin Ser His His Arg Ser Val 
1045 1050 1055 

Leu Val He Pro Glu Trp Glu Ala Glu Val Ser Gin Ala Val Arg Val 
1060 1065 1070 

Cys Pro Gly Arg Gly Tyr He Leu Arg Val Thr Ala Tyr Lys Glu Gly 
1075 1080 1085 

Tyr Gly Glu Gly Cys Val Thr He His Glu He Glu Asn Asn Thr Asp 
1090 1095 1100 

Glu Leu Lys Phe Lys Asn Cys Glu Glu Glu Glu Val Tyr Pro Thr Asp 
105 1110 1115 1120 

Thr Gly Thr Cys Asn Asp Tyr Thr Ala His Gin Gly Thr Ala Val Cys 
1125 1130 1135 

Asn Ser Arg Asn Ala Gly Tyr Glu Asp Ala Tyr Glu Val Asp Thr Thr 
1140 1145 1150 

Ala Ser Val Asn Tyr Lys Pro Thr Tyr Glu Glu Glu Thr Tyr Thr Asp 
1155 1160 1165 
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Val Arg Arg Asp Asn His Cys Glu Tyr Asp Arg Gly Tyr Val Asn Tyr 

1170 1175 1180 

Pro Pro Leu Pro Ala Gly Tyr Met Thr Lys Glu Leu Glu Tyr Phe Pro 

185 1190 1195 1200 

Glu Thr Asp Lys Val Trp lie Glu He Gly Glu Thr Glu Gly Lys Phe 

' 1205 1210 1215 

He Val Asp Ser Val Glu Leu Leu Leu Met Glu Glu 

1220 1225 



<210> 3 
<211> 3507 
<212> DNA 

<213> Bacillus thuringiensis 

<220> 

<221> CDS 

<222> (1) . . (3507) 

<400> 3 

atg gag ata aat aat cag aac caa tgc at a cca tat aat tgc tta agt 48 

Met Glu He Asn Asn Gin Asn Gin Cys He Pro Tyr Asn Cys Leu Ser 

1 5 10 15 

aag cct gag gaa gta ttt ttg gat ggg gag agg ata tta cct gat ate 96 
Lys Pro Glu Glu Val Phe Leu Asp Gly Glu Arg He Leu Pro Asp He 
20 25 30 

gat cca etc gaa gtt tct ttg teg ctt ttg caa ttt ctt ttg aat aac 144 
Asp Pro Leu Glu Val Ser Leu Ser Leu Leu Gin Phe Leu Leu Asn Asn 
35 40 45 

ttt gtt ccg ggg ggg ggg ttt att tea gga tta att gac aaa ata tgg 192 
Phe Val Pro Gly Gly Gly Phe He Ser Gly Leu He Asp Lys He Trp 
50 55 60 

ggg get ttg aga cca tct gaa tgg gaa tta ttt ctt gca cag att gaa 240 
Gly Ala Leu Arg Pro Ser Glu Trp Glu Leu Phe Leu Ala Gin He Glu 
65 70 75 80 

cag ttg att gat cga aga ata gaa gca aca gta aga gca aaa gca ate 2 88 
Gin Leu He Asp Arg Arg He Glu Ala Thr Val Arg Ala Lys Ala He 
85 90 95 

get gaa tta gaa ggt tta ggg aga agt tat caa eta tat gga gag gca 3 36 
Ala Glu Leu Glu Gly Leu Gly Arg Ser Tyr Gin Leu Tyr Gly Glu Ala 
100 105 110 
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ttt aaa gag tgg gaa aaa act cca gat aac aca gcg get egg tct aga 3 84 
Phe Lys Glu Trp Glu Lys Thr Pro Asp Asn Thr Ala Ala Arg Ser Arg 
115 120 125 

gta act gag aga ttt cgt ata att gat get caa att gaa gca aat ate 4 32 
Val Thr Glu Arg Phe Arg lie lie Asp Ala Gin lie Glu Ala Asn lie 
130 135 140 

cct teg ttt egg gtt tec gga ttt gaa gtg cca ctt eta teg gtt tat 4 80 
Pro Ser Phe Arg Val Ser Gly Phe Glu Val Pro Leu Leu Ser Val Tyr 
145 150 155 160 

ace caa gca get aat ttg cat etc get eta tta aga gat tct gtt att 52 8 
Thr Gin Ala Ala Asn Leu His. Leu Ala Leu Leu Arg Asp Ser Val lie 
165 170 175 



ttt gga gag aga tgg gga ttg teg act aca aat gtc aat gat ate tat 
Phe Gly Glu Arg Trp Gly Leu Ser Thr Thr Asn Val Asn Asp lie Tyr 
180 185 190 



576 



720 



768 



aat aga caa gtt aag aga att cat gaa tat age gat cat tgt gta gat 624 
Asn Arg Gin Val Lys Arg lie His Glu Tyr Ser Asp His Cys Val Asp 
195 200 205 

acg tat aaa aca gaa tta gaa cgt eta gag ttt aga tct ata gcg caa 672 
Thr Tyr Lys Thr Glu Leu Glu Arg Leu Glu Phe Arg Ser lie Ala Gin 
210 215 220 

tgg aga ata tat aat cag ttt aga agg gaa ttg aca eta acg gta tta 
Trp Arg lie Tyr Asn Gin Phe Arg Arg Glu Leu Thr Leu Thr Val Leu 
225 230 235 240 

gat att gtc get ctt ttc ccg aac tat gat ggt aga ctg tat ccg att 
Asp He Val Ala Leu Phe Pro Asn Tyr Asp Gly Arg Leu Tyr Pro He 
245 250 255 

cga aca att tct caa ttg aca aga gaa att tat aca tec cca gta age 816 
Arg Thr He Ser Gin Leu Thr Arg Glu He Tyr Thr Ser Pro Val Ser 
260 265 270 

gaa ttt tat tat ggt ccc att tat aat cat aat atg att ggt ace ttt ,864 
Glu Phe Tyr Tyr Gly Pro He Tyr Asn His Asn Met He Gly Thr Phe 
275 280 285 

att gaa egg cag eta agg cga cca cat ctt atg gac ttc ttt aac tec 912 
He Glu Arg Gin Leu Arg Arg Pro His Leu Met Asp Phe Phe Asn Ser 
290 295 300 

atg ace atg tat aca tea gat aat aga egg gaa tat tat tgg tea gga 960 
Met Thr Met Tyr Thr Ser Asp Asn Arg Arg Glu Tyr Tyr Trp Ser Gly 
305 * 310 315 320 

ctt gaa atg acg get act ctt act tea gga aat caa gtg tea ttc cct 1008 
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Leu Glu Met Thr Ala Thr Leu Thr Ser Gly Asn Gin Val Ser Phe Pro 
325 330 335 

tta gcc ggg act aga ggg aat tea get cca cca gta tct gtt aga aag 1056 
Leu Ala Gly Thr Arg Gly Asn Ser Ala Pro Pro Val Ser Val Arg Lys 
340 345 350 

act ggt gag gga att tat aga ata tta teg gaa cca ttt tat tea gca 1104 
Thr Gly Glu Gly lie Tyr Arg lie Leu Ser Glu Pro Phe Tyr Ser Ala 
355 360 365 

cct ttt eta ggc ace agt gtg eta gga agt cgc ggg gaa gaa ttt get 1152 
Pro Phe Leu Gly Thr Ser Val Leu Gly Ser Arg Gly Glu Glu Phe Ala 
370 375 380 

ttt gca tct aat act act aca agt ctg cca tct aca ata tat aga aat 1200 
Phe Ala Ser Asn Thr Thr Thr Ser Leu Pro Ser Thr lie Tyr Arg Asn 
385 390 395 400 

cgt gga aca gta gat tea tta gtc age ata ccg cca caa gat tat age 1248 
Arg Gly Thr Val Asp Ser Leu Val Ser He Pro Pro Gin Asp Tyr Ser 
405 410 415 

gta cca ccg cac agg ggg tat agt cat tta tta agt cac gtt acg atg 12 96 
Val Pro Pro His Arg Gly Tyr Ser His Leu Leu Ser His Val Thr Met 
420 425 430 

cac aat agt tct cct ata ttc cac tgg acg cat cgt agt gcg aca cct 1344 
His Asn Ser Ser Pro He Phe His Trp Thr His Arg Ser Ala Thr Pro 
435 440 445 

aga aat ata att tat cca gat agt ate act caa ate cca gta gtt aag 1392 
Arg Asn He He Tyr Pro Asp Ser He Thr Gin He Pro Val Val Lys 
450 455 460 

get teg cac etc tct ggt ggt tea gtt ata aaa gga cct gga cat aca 1440 
Ala Ser His Leu Ser Gly Gly Ser Val He Lys Gly Pro Gly His Thr 
465 470 475 480 

ggt gga gat tta ata age eta cct gta aat aac ttt act cat ttc cga 14 88 
Gly Gly Asp Leu He Ser Leu Pro Val Asn Asn Phe Thr His Phe Arg 
485 490 495 

ate cca ttt cag gca aac act cca caa agg tat cgt att aga att tgt 153 6 
He Pro Phe Gin Ala Asn Thr Pro Gin Arg Tyr Arg He Arg He Cys 
500 505 510 

tat gcg gca gac tea gat ggg act ttg gat agt gga gtt ttc tta agt 15 84 
Tyr Ala Ala Asp Ser Asp Gly Thr Leu Asp Ser Gly Val Phe Leu Ser 
515 520 525 
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gca gca gca ggg gat ggt ttt aat aca act tct tat agg gcc aca atg 1632 

Ala Ala Ala Gly Asp Gly Phe Asn Thr Thr Ser Tyr Arg Ala Thr Met 
530 535 540 

age cct gaa ggt tec tta aca tct cgt gat ttt caa ttt tta gat tta 1680 

Ser Pro Glu Gly Ser Leu Thr Ser Arg Asp Phe Gin Phe Leu Asp Leu 
545 550 555 560 

aac aca teg ttt acc tec gat gta gca tct aac tta tgg tta cat ttt 172 8 

Asn Thr Ser Phe Thr Ser Asp Val Ala Ser Asn Leu Trp Leu His Phe 

565 570 575 

ata cgt tat ata cga cca ggg aat ttg tat ata gat aga gcg gaa ttt 1776 

lie Arg Tyr lie Arg Pro Gly Asn Leu Tyr He Asp Arg Ala Glu Phe 
580 585 590 

ate cca gtg gat gca acc ttc gag gca ggt tat aat tta gaa agg gcg 1824 

He Pro Val Asp Ala Thr Phe Glu Ala Gly Tyr Asn Leu Glu Arg Ala 
595 600 605 

caa aag gcg gtg aat gcc ctg ttt act tct aca aac caa aaa gga tta 1872 

Gin Lys Ala Val Asn Ala Leu Phe Thr Ser Thr Asn Gin Lys Gly Leu 
610 615 620 

caa aca gat gtg acg gat tat cat att gat caa gta tec aat eta gtt 192 0 

Gin Thr Asp Val Thr Asp Tyr His He Asp Gin Val Ser Asn Leu Val 
625 630 635 640 

gat tgt tta tct gat gag ttt tgc tta gat gaa aag cga aaa ttg tec 1968 

Asp Cys Leu Ser Asp Glu Phe Cys Leu Asp Glu Lys Arg Lys Leu Ser 

645 650 655 

gag aaa gtc aaa cag gcg aag cga etc agt gat gag egg aat tta ctt 2 016 

Glu Lys Val Lys Gin Ala Lys Arg Leu Ser Asp Glu Arg Asn Leu Leu 
660 665 670 

cag gat tea aat ttc aga ggc ate aat agg gaa caa gac cgt gga tgg 2064 
Gin Asp Ser Asn Phe Arg Gly He Asn Arg Glu Gin Asp Arg Gly Trp 
675 680 685 

aga gga agt acg gat att act ate caa gga gga aac gat gtt ttc aaa 2112 
Arg Gly Ser Thr Asp He Thr He Gin Gly Gly Asn Asp Val Phe Lys 
690 695 700 

gag aat tac gtt aca eta cca ggt acc ttt gat gcg tgt tat cca aca 2160 

Glu Asn Tyr Val Thr Leu Pro Gly Thr Phe Asp Ala Cys Tyr Pro Thr 
705 710 715 720 

tat ttg tat caa aaa gtc gat gaa tea aaa tta aaa gcc tat acc cgt 2208 
Tyr Leu Tyr Gin Lys Val Asp Glu Ser Lys Leu Lys Ala Tyr Thr Arg 

725 730 735 
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tat caa tta aga ggc tat ate gaa gat agt caa gat tta gaa gtc tat 2256 
Tyr Gin Leu Arg Gly Tyr lie Glu Asp Ser Gin Asp Leu Glu Val Tyr 
740 745 750 

ttg ate cgt tat aat gca aaa tat gaa acg tta aat gtg cca ggt atg 2 3 04 
Leu lie Arg Tyr Asn Ala Lys Tyr Glu Thr Leu Asn Val Pro Gly Met 
755 760 765 

ggt tct tta tgg cca ctt tea gtc gaa agt cca att gga aag tgt gga 2 352 
Gly Ser Leu Trp Pro Leu Ser Val Glu Ser Pro lie Gly Lys Cys Gly 
770 775 780 

gaa ccg aat cga tgt gtg cca caa ctt gaa tgg aat ccc gat ttc gat 2 400 
Glu Pro Asn Arg Cys Val Pro Gin Leu Glu Trp Asn Pro Asp Phe Asp 
785 790 795 800 

tgt tec tgc aga gac gga gaa aaa tgt gcg cat cat teg cat cat ttc 2448 
Cys Ser Cys Arg Asp Gly Glu Lys Cys Ala His His Ser His His Phe 
805 810 815 

tec ttg gac att gat gtt gga tgt aca gac ttg aat gag aac eta ggt 2 4 96 
Ser Leu Asp lie Asp Val Gly Cys Thr Asp Leu Asn Glu Asn Leu Gly 
820 825 830 

ata tgg gtg ata ttc aag att aag aca cag gat ggt cat gca aga eta 2 544 
lie Trp Val He Phe Lys He Lys Thr Gin Asp Gly His Ala Arg Leu 
835 840 845 

gga aat eta gaa ttt etc gaa gag aaa ccg tta tta gga gaa gcg tta 2592 
Gly Asn Leu Glu Phe Leu Glu Glu Lys Pro Leu Leu Gly Glu Ala Leu 
850 855 860 

gec cgt gtg aag aga gcg gag aaa aaa tgg aga gac aaa cgt gaa ata 2 640 
Ala Arg Val Lys Arg Ala Glu Lys Lys Trp Arg Asp Lys Arg Glu He 
865 870 875 880 

ttg caa tea gaa act aat att gtt tat aaa gaa gca aaa gaa get gta 2688 
Leu Gin Ser Glu Thr Asn He Val Tyr Lys Glu Ala Lys Glu Ala Val 
885 890 895 

gat ggt tta ttc gta gat tec caa tat gag aga tta caa tct gat acg 2736 
Asp Gly Leu Phe Val Asp Ser Gin Tyr Glu Arg Leu Gin Ser Asp Thr 
900 905 910 



aat ate gee atg att cat gcg gca gat aaa.cgc gtt cac cga ate cga 

Asn He Ala Met He His Ala Ala Asp Lys Arg Val His Arg He Arg 
915 920 925 

gag get tat ctg cca gag etc tec gtg att cca ggt gtc aat gca gcg 

Glu Ala Tyr Leu Pro Glu Leu Ser Val He Pro Gly Val Asn Ala Ala 
930 935 940 
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2784 



2832 



att ttc gaa gaa tta gaa gga cgt att ttc aca gcc tac tct eta tat 2880 
He Phe Glu Glu Leu Glu Gly Arg He Phe Thr Ala Tyr Ser Leu Tyr 
945 950 955 960 



gca tat gaa atg aat act aca aca tct gtt aat tac aaa ccg act tac 
Ala Tyr Glu Met Asn Thr Thr Thr Ser Val Asn Tyr Lys Pro Thr Tyr 
1090 1095 1100 

gag gaa gaa gta tat aca gat gga cga aga gat aat cct tgt gaa atg 
Glu Glu Glu Val Tyr Thr Asp Gly Arg Arg Asp Asn Pro Cys Glu Met 
1105 * 1110 1H5 1120 

gaa aga ggt tac aca cca tta cca.gtt ggt tat gta aca aaa gaa tta 
Glu Arg Gly Tyr Thr Pro Leu Pro Val Gly Tyr Val Thr Lys Glu Leu 
1125 H30 H35 

gaa tac ttc cct gaa aca aat aca gta tgg att gaa att gga gaa acg 
Glu Tyr Phe Pro Glu Thr Asn Thr Val Trp He Glu He Gly Glu Thr 
1140 H45 H50 



2976 



3024 



3072 



gat gcg aga aat gtc att aaa aac ggt gat ttc aat aat ggc tta tea 2 92 8 
Asp Ala Arg Asn Val He Lys Asn Gly Asp Phe Asn Asn Gly Leu Ser 
965 970 975 

tgc tgg aat gtg aaa ggg cat gta gat ata aaa cag aat ggt cat cgt 
Cys Trp Asn Val Lys Gly His Val Asp He Lys Gin Asn Gly His Arg 
980 985 990 

tct gtt ctt gtt ate cca gaa tgg gaa gca caa gtt tea caa gaa gtt 
Ser Val Leu Val He Pro Glu Trp Glu Ala Gin Val Ser Gin Glu Val 
995 1000 1005 

cgt gta tgt cca ggt cgt ggc tac ate ctt cgt gtt aca gcg aac aaa 
Arg Val Cys Pro Gly Arg Gly Tyr He Leu Arg Val Thr Ala Asn Lys 
1010 1015 1020 

gaa gga tat gga gaa ggc tgc gta acg att cat gag att gag aat cat 312 0 
Glu Gly Tyr Gly Glu Gly Cys Val Thr He His Glu He Glu Asn His 
1025 " 1030 ~ 1035 1040 

act gaa aaa ctg aag ttt aga aac tgc gaa gaa gag gaa gta tat cca 3168 
Thr Glu Lys Leu Lys Phe Arg Asn Cys Glu Glu Glu Glu Val Tyr Pro 
1045 1050 1055 

aac aat aca gga acg tgt aat gat tat act gca cac caa caa ggt aca 3216 
Asn Asn Thr Gly Thr Cys Asn Asp Tyr Thr Ala His Gin Gin Gly Thr 
1060 1065 1070 

gca gga tgc gca gat gca tgt aat ace cgt aat gtc gga tat gaa gat 3264 
Ala Gly Cys Ala Asp Ala Cys Asn Thr Arg Asn Val Gly Tyr Glu Asp 
1075 1080 1085 



3312 



3360 



3408 



3456 
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gaa ggg aag ttt att gta gac agt gtc gaa tta etc ctt atg gaa gaa 3504 
Glu Gly Lys Phe He Val Asp Ser Val Glu Leu Leu Leu Met Glu Glu 
1155 1160 H65 

3507 

tag 



<210> 4 
<211> 1168 
<212> PRT 

<213> Bacillus thuringiensis 
<400> 4 

Met Glu He Asn Asn Gin Asn Gin Cys He Pro Tyr Asn Cys Leu Ser 
15 10 15 

Lys Pro Glu Glu Val Phe Leu Asp Gly Glu Arg He Leu Pro Asp He 
2 0 25 30 

Asp Pro Leu Glu Val Ser Leu Ser Leu Leu Gin Phe Leu Leu Asn Asn 
35 40 45 

Phe Val Pro Gly Gly Gly Phe He Ser Gly Leu He Asp Lys He Trp 
50 55 60 

Gly Ala Leu Arg Pro Ser Glu Trp Glu Leu Phe Leu Ala Gin He Glu 
65 ~ 70 75 80 

Gin Leu He Asp Arg Arg He Glu Ala Thr Val Arg Ala Lys Ala He 
85 90 95 

Ala Glu Leu Glu Gly Leu Gly Arg Ser Tyr Gin Leu Tyr Gly Glu Ala 
100 105 HO 

Phe Lys Glu Trp Glu Lys Thr Pro Asp Asn Thr Ala Ala Arg Ser Arg 
115 ~ 120 125 

Val Thr Glu Arg Phe Arg He He Asp Ala Gin He Glu Ala Asn He 
130 135 140 

Pro Ser Phe Arg Val Ser Gly Phe Glu Val Pro Leu Leu Ser Val Tyr 
145 150 155 160 

Thr Gin Ala Ala Asn Leu His Leu Ala Leu Leu Arg Asp Ser Val He 
165 170 175 

Phe Gly Glu Arg Trp Gly Leu Ser Thr Thr Asn Val Asn Asp He Tyr 
180 ' 185 190 

Asn Arg Gin Val Lys Arg He His Glu Tyr Ser Asp His Cys Val Asp 
195 200 205 
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Thr Tyr Lys Thr Glu Leu Glu Arg Leu Glu Phe Arg Ser lie Ala Gin 
210 215 220 

Trp Arg lie Tyr Asn Gin Phe Arg Arg Glu Leu Thr Leu Thr Val Leu 
225 230 235 240 

Asp lie Val Ala Leu Phe Pro Asn Tyr Asp Gly Arg Leu Tyr Pro lie 
245 250 255 

Arg Thr lie Ser Gin Leu Thr Arg Glu He Tyr Thr Ser Pro Val Ser 
260 265 270 

Glu Phe Tyr Tyr Gly Pro He Tyr Asn His Asn Met He Gly Thr Phe 
275 280 285 

He Glu Arg Gin Leu Arg Arg Pro His Leu Met Asp Phe Phe Asn Ser 
290 295 300 

Met Thr Met Tyr Thr Ser Asp Asn Arg Arg Glu Tyr Tyr Trp Ser Gly 
305 310 315 320 

Leu Glu Met Thr Ala Thr Leu Thr Ser Gly Asn Gin Val Ser Phe Pro 
325 330 335 

Leu Ala Gly Thr Arg Gly Asn Ser Ala Pro Pro Val Ser Val Arg Lys 
340 345 350 

Thr Gly Glu Gly He Tyr Arg He Leu Ser Glu Pro Phe Tyr Ser Ala 
355 360 365 

Pro Phe Leu Gly Thr Ser Val Leu Gly Ser Arg Gly Glu Glu Phe Ala 
370 375 380 

Phe Ala Ser Asn Thr Thr Thr Ser Leu Pro Ser Thr He Tyr Arg Asn 
385 390 395 400 

Arg Gly Thr Val Asp Ser Leu Val Ser He Pro Pro Gin Asp Tyr Ser 
405 410 415 

Val Pro Pro His Arg Gly Tyr Ser His Leu Leu Ser His Val Thr Met 
420 425 430 

His Asn Ser Ser Pro lie Phe His Trp Thr His Arg Ser Ala Thr Pro 
435 440 445 

Arg Asn He He Tyr Pro Asp Ser He Thr Gin He Pro Val Val Lys 
450 455 460 

Ala Ser His Leu Ser Gly Gly Ser Val He Lys Gly Pro Gly His Thr 
465 470 475 480 

Gly Gly Asp Leu He Ser Leu Pro Val Asn Asn Phe Thr His Phe Arg 
485 490 495 
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lie Pro Phe Gin Ala Asn Thr Pro Gin Arg Tyr Arg lie Arg lie Cys 
500 505 510 



Tyr Ala Ala Asp Ser Asp Gly Thr Leu Asp Ser Gly Val Phe Leu Ser 
515 520 525 

Ala Ala Ala Gly Asp Gly Phe Asn Thr Thr Ser Tyr Arg Ala Thr Met 
530 535 540 

Ser Pro Glu Gly Ser Leu Thr Ser Arg Asp Phe Gin Phe Leu Asp Leu 
545 550 555 560 

Asn Thr Ser Phe Thr Ser Asp Val Ala Ser Asn Leu Trp Leu His Phe 
565 570 575 

lie Arg Tyr lie Arg Pro Gly Asn Leu Tyr lie Asp Arg Ala Glu Phe 
580 585 590 

lie Pro Val Asp Ala Thr Phe Glu Ala Gly Tyr Asn Leu Glu Arg Ala 
595 600 605 

Gin Lys Ala Val Asn Ala Leu Phe Thr Ser Thr Asn Gin Lys Gly Leu 
610 615 620 

Gin Thr Asp Val Thr Asp Tyr His lie Asp Gin Val Ser Asn Leu Val 
625 630 635 640 

Asp Cys Leu Ser Asp Glu Phe Cys Leu Asp Glu Lys Arg Lys Leu Ser 
645 650 655 

Glu Lys Val Lys Gin Ala Lys Arg Leu Ser Asp Glu Arg Asn Leu Leu 
660 665 670 

Gin Asp Ser Asn Phe Arg Gly lie Asn Arg Glu Gin Asp Arg Gly Trp 
675 680 685 

Arg Gly Ser Thr Asp lie Thr lie Gin Gly Gly Asn Asp Val Phe Lys 
690 695 700 

Glu Asn Tyr Val Thr Leu Pro Gly Thr Phe Asp Ala Cys Tyr Pro Thr 
705 710 715 720 

Tyr Leu Tyr Gin Lys Val Asp Glu Ser Lys Leu Lys Ala Tyr Thr Arg 
725 730 735 

Tyr Gin Leu Arg Gly Tyr lie Glu Asp Ser Gin Asp Leu Glu Val Tyr 
740 745 750 

Leu lie Arg Tyr Asn Ala Lys Tyr Glu Thr Leu Asn Val Pro Gly Met 
755 760 765 
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Gly Ser Leu Trp Pro Leu Ser Val Glu Ser Pro lie Gly Lys Cys Gly 
770 775 780 



Glu Pro Asn Arg Cys Val Pro Gin Leu Glu Trp Asn Pro Asp Phe Asp 
785 790 795 800 

Cys Ser Cys Arg Asp Gly Glu Lys Cys Ala His His Ser His His Phe 
805 810 815 

Ser Leu Asp lie Asp Val Gly Cys Thr Asp Leu Asn Glu Asn Leu Gly 
820 825 830 

He Trp Val He Phe Lys He Lys Thr Gin Asp Gly His Ala Arg Leu 
835 840 845 

Gly Asn Leu Glu Phe Leu Glu Glu Lys Pro Leu Leu Gly Glu Ala Leu 
850 855 860 

Ala Arg Val Lys Arg Ala Glu Lys Lys Trp Arg Asp Lys Arg Glu He 
865 870 875 880 

Leu Gin Ser Glu Thr Asn He Val Tyr Lys Glu Ala Lys Glu Ala Val 
885 890 895 

Asp Gly Leu Phe Val Asp Ser Gin Tyr Glu Arg Leu Gin Ser Asp Thr 
900 905 910 

Asn He Ala Met He His Ala Ala Asp Lys Arg Val His Arg He Arg 
915 920 925 

Glu Ala Tyr Leu Pro Glu Leu Ser Val He Pro Gly Val Asn Ala Ala 
930 935 940 

He Phe Glu Glu Leu Glu Gly Arg He Phe Thr Ala Tyr Ser Leu Tyr 
945 950 955 960 

Asp Ala Arg Asn Val He Lys Asn Gly Asp Phe Asn Asn Gly Leu Ser 
965 970 975 

Cys Trp Asn Val Lys Gly His Val Asp He Lys Gin Asn Gly His Arg 
980 985 990 

Ser Val Leu Val He Pro Glu Trp Glu Ala Gin Val Ser Gin Glu Val 
995 1000 1005 

Arg Val Cys Pro Gly Arg Gly Tyr He Leu Arg Val Thr Ala Asn Lys 
1010 1015 1020 

Glu Gly Tyr Gly Glu Gly Cys Val Thr He His Glu He Glu Asn His 
025 1030 1035 1040 

Thr Glu Lys Leu Lys Phe Arg Asn Cys Glu Glu Glu Glu Val Tyr Pro 
1045 1050 1055 
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Asn Asn Thr Gly Thr Cys Asn Asp Tyr Thr Ala His Gin Gin Gly Thr 
1060 1065 1070 



Ala Gly Cys Ala Asp Ala Cys Asn Thr Arg Asn Val Gly Tyr Glu Asp 
1075 1080 1085 

Ala Tyr Glu Met Asn Thr Thr Thr Ser Val Asn Tyr Lys Pro Thr Tyr 
1090 1095 1100 

Glu Glu Glu Val Tyr Thr Asp Gly Arg Arg Asp Asn Pro Cys Glu Met 
105 1110 1H5 1120 

Glu Arg Gly Tyr Thr Pro Leu Pro Val Gly Tyr Val Thr Lys Glu Leu 
1125 1130 1135 

Glu Tyr Phe Pro Glu Thr Asn Thr Val Trp He Glu He Gly Glu Thr 
1140 1145 H50 

Glu Gly Lys Phe He Val Asp Ser Val Glu Leu Leu Leu Met Glu Glu 
1155 1160 H65 



<210> 5 
<211> 3459 
<212> DNA 

<213> Bacillus thuringiensis 

<220> 

<221> CDS 

<222> (1) . . (3459) 



<400> 5 

atg aac cga aat aat caa aat gat tat gaa gtt att gac get tec aat 48 
Met Asn Arg Asn Asn Gin Asn Asp Tyr Glu Val He Asp Ala Ser Asn 
15 10 15 

tgt ggt tgt gcg tea gat gat gtt gtt caa tac cct ttg gca aga gat 96 
Cys Gly Cys Ala Ser Asp Asp Val Val Gin Tyr Pro Leu Ala Arg Asp 
20 25 30 

ccg aat get gta ttc caa aat atg cat tat aaa gat tat ttg caa acg 144 
Pro Asn Ala Val Phe Gin Asn Met His Tyr Lys Asp Tyr Leu Gin Thr 
35 40 45 

tat gat gga gac tat aca ggt tct ttt ata aat cct aac tta tct att 192 
Tyr Asp Gly Asp Tyr Thr Gly Ser Phe He Asn Pro Asn Leu Ser He 
50 55 60 



aat cct aga gat gta ctg caa act gga att aat att gtg gga aga tta 
Asn Pro Arg Asp Val Leu Gin Thr Gly He Asn He Val Gly Arg Leu 
65 70 75 80 



240 



48 



eta gga ttt eta ggt gtt cca ttt get ggt cag tta gtt act ttc tat 288 
Leu Gly Phe Leu Gly Val Pro Phe Ala Gly Gin Leu Val Thr Phe Tyr 
85 90 95 

act ttt ctt tta aat caa eta tgg cca aca aat gat aat gca gta tgg 33 6 
Thr Phe Leu Leu Asn Gin Leu Trp Pro Thr Asn Asp Asn Ala Val Trp 
100 105 110 

gaa get ttt atg gca caa ata gaa gag ctt att aat caa aga ata tec 3 84 
Glu Ala Phe Met Ala Gin lie Glu Glu Leu lie Asn Gin Arg lie Ser 
115 120 125 

gaa gca gta gta ggg aca gca gcg gat cat tta acg gga tta cac gat 4 32 
Glu Ala Val Val Gly Thr Ala Ala Asp His Leu Thr Gly Leu His Asp 
130 135 140 

aat tat gag tta tat gta gag gca ttg gaa gaa tgg ctg gaa aga ccg 4 80 
Asn Tyr Glu Leu Tyr Val Glu Ala Leu Glu Glu Trp Leu Glu Arg Pro 
145 " 150 155 160 

aat get get aga act aat eta ctt ttt aat aga ttt acc ace eta gat 528 
Asn Ala Ala Arg Thr Asn Leu Leu Phe Asn Arg Phe Thr Thr Leu Asp 
165 170 175 

agt ctt ttt aca caa ttt atg cca age ttt ggt act gga cct gga agt 576 
Ser Leu Phe Thr Gin Phe Met Pro Ser Phe Gly Thr Gly Pro Gly Ser 
180 185 190 

caa aac tac gca gtt cca tta ctt aca gta tac gca caa gca gcg aac 624 
Gin Asn Tyr Ala Val Pro Leu Leu Thr Val Tyr Ala Gin Ala Ala Asn 
195 200 205 

ctt cat ttg tta tta tta aag gat get gaa ata tat gga gca aga tgg 672 
Leu His Leu Leu Leu Leu Lys Asp Ala Glu lie Tyr Gly Ala Arg Trp 
210 215 220 

gga ctg aac caa aat cag att aac tea ttc cat acg cgc caa caa gag 720 
Gly Leu Asn Gin Asn Gin lie Asn Ser Phe His Thr Arg Gin Gin Glu 
225 230 235 240 

cgt act caa tat tat aca aat cat tgc gta acg acg tat aat acc ggt 768 
Arg Thr Gin Tyr Tyr Thr Asn His Cys Val Thr Thr Tyr Asn Thr Gly 
245 250 255 

tta gat aga tta aga ggc aca aat act gaa agt tgg tta aat tat cat 816 
Leu Asp Arg Leu Arg Gly Thr Asn Thr Glu Ser Trp Leu Asn Tyr His 
260 265 270 



cga ttc cgt aga gag atg aca tta atg gca atg gat tta gtg gee tta 
Arg Phe Arg Arg Glu Met Thr Leu Met Ala % Met Asp Leu Val Ala Leu 
275 280 285 
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864 



ttc cca tac tat aat gtg cga caa tat cca aat ggg gca aat cca cag 912 
Phe Pro Tyr Tyr Asn Val Arg Gin Tyr Pro Asn Gly Ala Asn Pro Gin 
290 295 300 

ctt aca cgt gaa ata tat acg gat cca ate gta tat aat cca cca get 
Leu Thr Arg Glu lie Tyr Thr Asp Pro He Val Tyr Asn Pro Pro Ala 
305 ~ 310 315 320 

aat cag gga ate tgc cga cgt tgg ggg aat aat cct tat aat aca ttt 
Asn Gin Gly He Cys Arg Arg Trp Gly Asn Asn Pro Tyr Asn Thr Phe 
325 330 335 

tct gaa ctt gaa aat get ttt att cgc ccg cca cat ctt ttt gat agg 
Ser Glu Leu Glu Asn Ala Phe He Arg Pro Pro His Leu Phe Asp Arg 
340 345 350 

ttg aat aga tta act att tct aga aac cga tat aca get cca aca act 
Leu Asn Arg Leu Thr He Ser Arg Asn Arg Tyr Thr Ala Pro Thr Thr 
355 360 365 

aat age tac eta gac tat tgg tea ggt cat act tta caa age cag tat 
Asn Ser Tyr Leu Asp Tyr Trp Ser Gly His Thr Leu Gin Ser Gin Tyr 
370 375 380 

gca aat aac ccg acg aca tat gaa act agt tac ggt cag att acc tct 
Ala Asn Asn Pro Thr Thr Tyr Glu Thr Ser Tyr Gly Gin He Thr Ser 
385 390 395 400 

aac aca cgt tta ttc aat acg act aat gga gec aat gca ata gat tea 
Asn Thr Arg Leu Phe Asn Thr Thr Asn Gly Ala Asn Ala He Asp Ser 
405 410 415 

agg gca aga aat ttt ggt aac tta tac get aat ttg tat ggt gtt age 12 96 
Arg Ala Arg Asn Phe Gly Asn Leu Tyr Ala Asn Leu Tyr Gly Val Ser 
420 425 430 



tat ttg aat att ttc cca aca ggt gtg atg agt gaa ate acc tea gee 
Tyr Leu Asn He Phe Pro Thr Gly Val Met Ser Glu He Thr Ser Ala 
435 440 445 

cct aat acg tgt tgg caa gac ctt act aca act gag gaa eta cca eta 
Pro Asn Thr Cys Trp Gin Asp Leu Thr Thr Thr Glu Glu Leu Pro Leu 
450 455 460 

gtg aat aat aat ttt aat ctt tta tct cat gtt act ttc tta cgc ttc 
Val Asn Asn Asn Phe Asn Leu Leu Ser His Val Thr Phe Leu Arg Phe 
465 470 475 480 

aat act act cag ggt ggc ccc ctt gca act gta ggg ttt gta ccc aca 
Asn Thr Thr Gin Gly Gly Pro Leu Ala Thr Val Gly Phe Val Pro Thr 
485 490 495 



960 



1008 



1056 



1104 



1152 



1200 



1248 



1344 



1392 



1440 



1488 



50 



tat gtg tgg aca cgt caa gat gta gat ttt aat aat ata att act ccc 1536 

Tyr Val Trp Thr Arg Gin Asp Val Asp Phe Asn Asn lie lie Thr Pro 
500 505 510 

aat aga att act caa ata cca gtg gta aag gca tat gag eta agt agt 1584 

Asn Arg lie Thr Gin He Pro Val Val Lys Ala Tyr Glu Leu Ser Ser 

515 520 525 

ggt get act gtc gtg aaa ggt cca gga ttc aca gga gga gat gta ate 1632 

Gly Ala Thr Val Val Lys Gly Pro Gly Phe Thr Gly Gly Asp Val lie 
530 535 540 



cga aga aca aat act ggt gga ttc gga gca ata agg gtg teg gtc act 
Arg Arg Thr Asn Thr Gly Gly Phe Gly Ala He Arg Val Ser Val Thr 
545 550 555 560 



1680 



gga ccg eta aca caa cga tat cgc ata agg ttc cgt tat get teg aca 172 8 
Gly Pro Leu Thr Gin Arg Tyr Arg He Arg Phe Arg Tyr Ala Ser Thr 
565 570 575 

ata gat ttt gat ttc ttt gta aca cgt gga gga act act ata aat aat 1776 
He Asp Phe Asp Phe Phe Val Thr Arg Gly Gly Thr Thr He Asn Asn 
580 585 590 

ttt aga ttt aca cgt aca atg aac agg gga cag gaa tea aga tat gaa 1824 
Phe Arg Phe Thr Arg Thr Met Asn Arg Gly Gin Glu Ser Arg Tyr Glu 
595 600 605 

tec tat cgt act gta gag ttt aca act cct ttt aac ttt aca caa agt 
Ser Tyr Arg Thr Val Glu Phe Thr Thr Pro Phe Asn Phe Thr Gin Ser 
610 615 620 

caa gat ata att cga aca tct ate cag gga ctt agt gga aat ggg gaa 
Gin Asp He He Arg Thr Ser He Gin Gly Leu Ser Gly Asn Gly Glu 
625 630 635 640 

gta tac ctt gat aga att gaa ate ate cct gta aat cca aca cga gaa 
Val Tyr Leu Asp Arg He Glu He He Pro Val Asn Pro Thr Arg Glu 
645 650 655 

gcg gaa gag gat tta gaa gcg gcg aag aaa gcg gtg gcg age ttg ttt 
Ala Glu Glu Asp Leu Glu Ala Ala Lys Lys Ala Val Ala Ser Leu Phe 
660 665 670 

aca cgc aca agg gac gga tta caa gta aat gtg aca gat tat caa gtc 
Thr Arg Thr Arg Asp Gly Leu Gin Val Asn Val Thr Asp Tyr Gin Val 
675 680 685 

gat caa gcg gca aat tta gtg tea tgc tta tea gat gaa caa tat gcg 2112 
Asp Gin Ala Ala Asn Leu Val Ser Cys Leu Ser Asp Glu Gin Tyr Ala 
690 695 700 



51 



1872 



1920 



1968 



2016 



2064 



2208 



2256 



2304 



cat gat aaa aag atg tta ttg gaa gcg gta cgc gcg gca aaa cgc etc 2160 
His Asp Lys Lys Met Leu Leu Glu Ala Val Arg Ala Ala Lys Arg Leu 
705 710 715 720 

age cga gaa cgt aac tta ctt cag gat cca gat ttt aat aca ate aat 
Ser Arg Glu Arg Asn Leu Leu Gin Asp Pro Asp Phe Asn Thr lie Asn 
725 730 735 

agt aca gaa gaa aat gga tgg aaa gca agt aac ggc gtt act att agt 
Ser Thr Glu Glu Asn Gly Trp Lys Ala Ser Asn Gly Val Thr lie Ser 
740 745 750 

gag ggc ggt cca ttc tat aaa ggc cgt gca ctt cag eta gca agt gca 
Glu Gly Gly Pro Phe Tyr Lys Gly Arg Ala Leu Gin Leu Ala Ser Ala 
755 760 765 

cga gaa aat tac cca aca tac ate tat caa aaa gta gat gca teg gag 23 52 
Arg Glu Asn Tyr Pro Thr Tyr lie Tyr Gin Lys Val Asp Ala Ser Glu 
770 775 780 

tta aaa cct tat aca cga tat aga ctg gat ggg ttc gtg aag agt agt 24 00 
Leu Lys Pro Tyr Thr Arg Tyr Arg Leu Asp Gly Phe Val Lys Ser Ser 
785 790 795 800 

caa gat tta gaa att gat etc att cac cat cat aaa gtc cat ctt gtg 
Gin Asp Leu Glu lie Asp Leu lie His His His Lys Val His Leu Val 
805 810 815 

aaa aat gta eta gat aat tta gta tct gat act tac cca gat gat tct 
Lys Asn Val Leu Asp Asn Leu Val Ser Asp Thr Tyr Pro Asp Asp Ser 
820 825 830 

tgt agt gga ate aat cga tgt gag gaa caa cag atg gta aat gcg caa 
Cys Ser Gly He Asn Arg Cys Glu Glu Gin Gin Met Val Asn Ala Gin 
835 840 845 

ctg gaa aca gaa cat cat cat ccg atg gat tgc tgt gaa gca get caa 
Leu Glu Thr Glu His His His Pro Met Asp Cys Cys Glu Ala Ala Gin 
850 855 860 

aca cat gag ttt tct tec tat att gat aca ggg gat tta aat teg act 
Thr His Glu Phe Ser Ser Tyr He Asp Thr Gly Asp Leu Asn Ser Thr 
865 870 875 880 

gta gac cag gga ate tgg gtg ate ttt aaa gtt cga aca aca gat ggt 
Val Asp Gin Gly He Trp Val He Phe Lys Val Arg Thr Thr Asp Gly 
885 890 895 

tat gcg acg eta gga aat ctt gaa ttg gta gag gtc gga ccg tta ttg 2736 
Tyr Ala Thr Leu Gly Asn Leu Glu Leu Val Glu Val Gly Pro Leu Leu 
900 905 910 
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2448 



2496 



2544 



2592 



2640 



2688 



ggt gaa cct eta gaa cgt gaa caa aga gaa aat gcg aaa tgg aat gca 2784 
Gly Glu Pro Leu Glu Arg Glu Gin Arg Glu Asn Ala Lys Trp Asn Ala 
915 920 925 

gag tta gga aga aaa cgt gca gaa aca gat cgc gtg tat caa gat gec 2 832 
Glu Leu Gly Arg Lys Arg Ala Glu Thr Asp Arg Val Tyr Gin Asp Ala 
930 935 940 

aaa caa tec ate aat cat tta ttt gtg gat tat caa gat caa caa tta 2880 
Lys Gin Ser lie Asn His Leu Phe Val Asp Tyr Gin Asp Gin Gin Leu 
945 950 955 960 

aat cca caa ata ggg atg gca gat att atg gac get caa aat ctt gtc 292 8 
Asn Pro Gin lie Gly Met Ala Asp lie Met Asp Ala Gin Asn Leu Val 
965 970 975 

gca tea att tea gat gta tat age gat gca gta ctg caa ate cct gga 2976 
Ala Ser lie Ser Asp Val Tyr Ser Asp Ala Val Leu Gin lie Pro Gly 
980 985 990 

att aac tat gag att tac aca gag ctg tec aat cgc tta caa caa gca 3024 
lie Asn Tyr Glu lie Tyr Thr Glu Leu Ser Asn Arg Leu Gin Gin Ala 
995 1000 1005 

teg tat ctg tat acg tct cga aat gcg gtg caa aat ggg gac ttt aac 3 072 
Ser Tyr Leu Tyr Thr Ser Arg Asn Ala Val Gin Asn Gly Asp Phe Asn 
1010 1015 1020 

aac ggg eta gat age tgg aat gca aca gcg ggt gca teg gta caa cag 3120 
Asn Gly Leu Asp Ser Trp Asn Ala Thr Ala Gly Ala Ser Val Gin Gin 
1025 1030 1035 , 1040 

gat ggc aat acg cat ttc tta gtt ctt tct cat tgg gat gca caa gtt 3168 
Asp Gly Asn Thr His Phe Leu Val Leu Ser His Trp Asp Ala Gin Val 
1045 1050 1055 

tec caa caa ttt aga gtg cag ccg aat tgt aaa tat gta tta cgt gta 3216 
Ser Gin Gin Phe Arg Val Gin Pro Asn Cys Lys Tyr Val Leu Arg Val 
1060 1065 1070 

aca gca gag aaa gta ggc ggc gga gac gga tac gtg act ate egg gat 3264 
Thr Ala Glu Lys Val Gly Gly Gly Asp Gly Tyr Val Thr He Arg Asp 
1075 1080 1085 

ggt get cat cat aca gaa acg ctt aca ttt aat gca tgt gat tat gat 3312 
Gly Ala His His Thr Glu Thr Leu Thr Phe Asn Ala Cys Asp Tyr Asp 
1090 1095 1100 



ata aat ggc acg tac gtg act gat aat acg tat eta aca aaa gaa gtg 
He Asn Gly Thr Tyr Val Thr Asp Asn Thr Tyr Leu Thr Lys Glu Val 
1105 1110 1115 H20 
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3360 



ata ttc tat tea cat aca gaa cac atg tgg gta gag gta aat gaa aca 3408 
lie Phe Tyr Ser His Thr Glu His Met Trp Val Glu Val Asn Glu Thr 
1125 1130 1135 

gaa ggt gca ttt cat ata gat agt att gaa ttc gtt gaa aca gaa aag 3456 
Glu Gly Ala Phe His lie Asp Ser lie Glu Phe Val Glu Thr Glu Lys 
1140 1145 1150 

taa 3459 



<210> 6 
<211> 1152 
<212> PRT 

<213> Bacillus thuringiensis 
<400> 6 

Met Asn Arg Asn Asn Gin Asn Asp Tyr Glu Val He Asp Ala Ser Asn 
15 10 15 

Cys Gly Cys Ala Ser Asp Asp Val Val Gin Tyr Pro Leu Ala Arg Asp 
20 25 30 

Pro Asn Ala Val Phe Gin Asn Met His Tyr Lys Asp Tyr Leu Gin Thr 
35 40 45 

Tyr Asp Gly Asp Tyr Thr Gly Ser Phe He Asn Pro Asn Leu Ser He 
50 55 60 

Asn Pro Arg Asp Val Leu Gin Thr Gly He Asn He Val Gly Arg Leu 
65 70 75 80 

Leu Gly Phe Leu Gly Val Pro Phe Ala Gly Gin Leu Val Thr Phe Tyr 
85 90 95 

Thr Phe Leu Leu Asn Gin Leu Trp Pro Thr Asn Asp Asn Ala Val Trp 
100 105 HO 

Glu Ala Phe Met Ala Gin He Glu Glu Leu He Asn Gin Arg He Ser 
115 120 125 

Glu Ala Val Val Gly Thr Ala Ala Asp His Leu Thr Gly Leu His Asp 
130 135 140 

Asn Tyr Glu Leu Tyr Val Glu Ala Leu Glu Glu Trp Leu Glu Arg Pro 
145 150 155 160 

Asn Ala Ala Arg Thr Asn Leu Leu Phe Asn Arg Phe Thr Thr Leu Asp 
165 170 175 

Ser Leu Phe Thr Gin Phe Met Pro Ser Phe Gly Thr Gly Pro Gly Ser 
180 185 190 
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Gin Asn Tyr Ala Val Pro Leu Leu Thr Val Tyr Ala Gin Ala Ala Asn 
195 200 205 



Leu His Leu Leu Leu 
210 

Gly Leu Asn Gin Asn 
225 

Arg Thr Gin Tyr Tyr 
245 

Leu Asp Arg Leu Arg 
260 

Arg Phe Arg Arg Glu 
275 

Phe Pro Tyr Tyr Asn 
290 

Leu Thr Arg Glu lie 
305 

Asn Gin Gly lie Cys 
325 

Ser Glu Leu Glu Asn 
340 

Leu Asn Arg Leu Thr 
355 

Asn Ser Tyr Leu Asp 
370 

Ala Asn Asn Pro Thr 
385 

Asn Thr Arg Leu Phe 
405 

Arg Ala Arg Asn Phe 
420 

Tyr Leu Asn lie Phe 
435 

Pro Asn Thr Cys Trp 
450 



Leu Lys Asp Ala Glu lie 
215 

Gin lie Asn Ser Phe His 
230 235 

Thr Asn His Cys Val Thr 
250 

Gly Thr Asn Thr Glu Ser 
265 

Met Thr Leu Met Ala Met 
280 

Val Arg Gin Tyr Pro Asn 
295 

Tyr Thr Asp Pro lie Val 
310 315 

Arg Arg Trp Gly Asn Asn 
330 

Ala Phe lie Arg Pro Pro 
345 

lie Ser Arg Asn Arg Tyr 
360 

Tyr Trp Ser Gly His Thr 
375 

Thr Tyr Glu Thr Ser Tyr 
390 * 395 

Asn Thr Thr Asn Gly Ala 
410 

Gly Asn Leu Tyr Ala Asn 
425 

Pro Thr Gly Val Met Ser 
440 

Gin Asp Leu Thr Thr Thr 
455 



Tyr Gly Ala Arg Trp 
220 

Thr Arg Gin Gin Glu 
240 

Thr Tyr Asn Thr Gly 
255 

Trp Leu Asn Tyr His 
270 

Asp Leu Val Ala Leu 
285 

Gly Ala Asn Pro Gin 
300 

Tyr Asn Pro Pro Ala 
320 

Pro Tyr Asn Thr Phe 
335 

His Leu Phe Asp Arg 
350 

Thr Ala Pro Thr Thr 
365 

Leu Gin Ser Gin Tyr 
380 

Gly Gin He Thr Ser 
400 

Asn Ala He Asp Ser 
415 

Leu Tyr Gly Val Ser 
430 

Glu He Thr Ser Ala 
445 

Glu Glu Leu Pro Leu 
460 



55 



Val Asn Asn Asn Phe Asn Leu Leu Ser His Val Thr Phe Leu Arg Phe 
465 470 475 480 

Asn Thr Thr Gin Gly Gly Pro Leu Ala Thr Val Gly Phe Val Pro Thr 
485 490 495 

Tyr Val Trp Thr Arg Gin Asp Val Asp Phe Asn Asn He He Thr Pro 
500 505 510 

Asn Arg He Thr Gin He Pro Val Val Lys Ala Tyr Glu Leu Ser Ser 
515 520 525 

Gly Ala Thr Val Val Lys Gly Pro Gly Phe Thr Gly Gly Asp Val He 
530 535 540 

Arg Arg Thr Asn Thr Gly Gly Phe Gly Ala He Arg Val Ser Val Thr 
545 550 555 560 

Gly Pro Leu Thr Gin Arg Tyr Arg He Arg Phe Arg Tyr Ala Ser Thr 
565 570 575 

He Asp Phe Asp Phe Phe Val Thr Arg Gly Gly Thr Thr He Asn Asn 
580 585 590 

Phe Arg Phe Thr Arg Thr Met Asn Arg Gly Gin Glu Ser Arg Tyr Glu 
595 600 605 

Ser Tyr Arg Thr Val Glu Phe Thr Thr Pro Phe Asn Phe Thr Gin Ser 
610 615 620 

Gin Asp He He Arg Thr Ser He Gin Gly Leu Ser Gly Asn Gly Glu 
625 ^ 630 635 640 

Val Tyr Leu Asp Arg He Glu He He Pro Val Asn Pro Thr Arg Glu 
645 650 655 

Ala Glu Glu Asp Leu Glu Ala Ala Lys Lys Ala Val Ala Ser Leu Phe 
660 665 670 

Thr Arg Thr Arg Asp Gly Leu Gin Val Asn Val Thr Asp Tyr Gin Val 
675 680 685 

Asp Gin Ala Ala Asn Leu Val Ser Cys Leu Ser Asp Glu Gin Tyr Ala 
690 695 700 

His Asp Lys Lys Met Leu Leu Glu Ala Val Arg Ala Ala Lys Arg Leu 
705 * ~ 710 715 720 

Ser Arg Glu Arg Asn Leu Leu Gin Asp Pro Asp Phe Asn Thr He Asn 
725 730 735 

Ser Thr Glu Glu Asn Gly Trp Lys Ala Ser Asn Gly Val Thr He Ser 
740 745 750 
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Glu Gly Gly Pro Phe Tyr Lys Gly Arg Ala Leu Gin Leu Ala Ser Ala 
755 760 765 

Arg Glu Asn Tyr Pro Thr Tyr lie Tyr Gin Lys Val Asp Ala Ser Glu 
770 775 780 

Leu Lys Pro Tyr Thr Arg Tyr Arg Leu Asp Gly Phe Val Lys Ser Ser 
785 790 795 800 

Gin Asp Leu Glu lie Asp Leu lie His His His Lys Val His Leu Val 
805 810 815 

Lys Asn Val Leu Asp Asn Leu Val Ser Asp Thr Tyr Pro Asp Asp Ser 
820 825 830 

Cys Ser Gly lie Asn Arg Cys Glu Glu Gin Gin Met Val Asn Ala Gin 
835 840 845 

Leu Glu Thr Glu His His His Pro Met Asp Cys Cys Glu Ala Ala Gin 
850 855 860 

Thr His Glu Phe Ser Ser Tyr lie Asp Thr Gly Asp Leu Asn Ser Thr 
865 870 875 880 

Val Asp Gin Gly He Trp Val He Phe Lys Val Arg Thr Thr Asp Gly 
885 890 895 

Tyr Ala Thr Leu Gly Asn Leu Glu Leu Val Glu Val Gly Pro Leu Leu 
900 905 910 

Gly Glu Pro Leu Glu Arg Glu Gin Arg Glu Asn Ala Lys Trp Asn Ala 
915 920 925 

Glu Leu Gly Arg Lys Arg Ala Glu Thr Asp Arg Val Tyr Gin Asp Ala 
930 935 940 

Lys Gin Ser lie Asn His Leu Phe Val Asp Tyr Gin Asp Gin Gin Leu 
945 950 955 960 

Asn Pro Gin He Gly Met Ala Asp He Met Asp Ala Gin Asn Leu Val 
965 970 975 

Ala Ser He Ser Asp Val Tyr Ser Asp Ala Val Leu Gin He Pro Gly 
980 985 990 

He Asn Tyr Glu He Tyr Thr Glu Leu Ser Asn Arg Leu Gin Gin Ala 
995 1000 1005 

Ser Tyr Leu Tyr Thr Ser Arg Asn Ala Val Gin Asn Gly Asp Phe Asn 
1010 1015 1020 
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Asn Gly Leu Asp Ser Trp Asn Ala Thr Ala Gly Ala Ser Val Gin Gin 
025 " 1030 1035 1040 

Asp Gly Asn Thr His Phe Leu Val Leu Ser His Trp Asp Ala Gin Val 
1045 . 1050 1055 

Ser Gin Gin Phe Arg Val Gin Pro Asn Cys Lys Tyr Val Leu Arg Val 
1060 1065 1070 

Thr Ala Glu Lys Val Gly Gly Gly Asp Gly Tyr Val Thr lie Arg Asp 
1075 1080 1085 

Gly Ala His His Thr Glu Thr Leu Thr Phe Asn Ala Cys Asp Tyr Asp 
1090 1095 1100 

He Asn Gly Thr Tyr Val Thr Asp Asn Thr Tyr Leu Thr Lys Glu Val 
105 " IHO H15 H20 

He Phe Tyr Ser His Thr Glu His Met Trp Val Glu Val Asn Glu Thr 
1125 H30 H35 

Glu Gly Ala Phe His He Asp Ser He Glu Phe Val Glu Thr Glu Lys 
1140 H45 H50 



<210> 7 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
crylB . f w 

<400> 7 

cagtccaaac gggtataaac 



<210> 8 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer B.R 
<400> 8 

ctgcttcgaa ggttgcagta 



<210> 9 
<211> 20 
<212> DNA 



20 



20 
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<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer B.F 
<400> 9 

cagcgtatta agtcgatgga 



<210> 10 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer JFW 
<400> 10 

gcagctaatg ctaccacatc 



<210> 11 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer JRV 
<400> 11 

gtggcggtat gctgactaat 



<210> 12 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 9FW 
<400> 12 

gyttttattc gcccgccaca 



<210> 13 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 9RV 
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<400> 13 

cgacagtags acccactact 



